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METHODS UTILIZING NOVEL TARGET GENES RELATED 
TO IMMUNE-MEDIATED DISEASES 

FIELD OF THE INVENTION 
The present invention provides methods utilizing novel target genes related to 
immune-mediated diseases, such as asthma, allergy and autoimmune diseases. The 
invention is based on a molecular level description of the polarization of CD4+ precursor 
cells (Thp) from which T helper cells are known to originate. T helper cell subtypes (Thl 
and Th2) have an important role in the immune system. However, many pathological 
processes, such as allergies, are associated with the presence of T helper cells at the site of 
inflammation. 

BACKGROUND 

T helper cell subtypes Thl and Th2 cells arise from a common precursor cell in 
response to triggering through the T cell receptor and cytokine receptors for IL-12 or IL-4. 
This leads to activation of complex signaling pathways. Disturbances in the balance 
between type 1 and type 2 responses can lead to certain immune-mediated diseases such as 
asthma, allergy and certain autoimmune diseases (1-3). Thus, it is important to understand 
how Thl and Th2 cells are generated. 

Thl and Th2 cells are known to originate from naive CD4+ precursor cells (Thp) 
after antigenic activation through the T cell receptor (TCR) and co-stimulatory molecules 
in a suitable cytokine milieu. The main cytokines orchestrating Thl and Th2 development 
are IL-12 and IL-4, respectively. Triggering of the TCR and cytokine signaling leads to 
activation of complex, and to a large extent poorly understood, downstream signaling 
networks that finally result in maturation of the effector Thl and Th2 cells (4, 5). IL-12 
(and in human also IFNoc) induces the Thl type response by activating the Signal 
Transducer and Activator of Transcription 4 (STAT4) mediated signaling pathway (6-8). 
Some other cytokines such as IFNy and IL-18 can also promote Thl responses, especially 
in combination with IL-12. Th2 differentiation is induced by IL-4 through the STAT6 
signaling pathway (9-12). GATA binding protein 3 (GATA-3), avian musculoaponeurotic 
fibrosarcoma (v-maf) AS42, oncogene homolog (c-maf) and T-box expressed in T cells (T- 
bet) are also among the most important factors regulating the early polarization of Th2 and 
Thl cells respectively (11-16). 
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Another important cytokine involved in Thl and Th2 differentiation is TGFJ3. 
This immunosuppressive cytokine exhibits pleiotrophic activities in various cellular 
processes and, importantly, can suppress the differentiation of CD4+ cells into the Thl and 
Th2 subtypes (17). However, similar to IL-12 and IL-4, the target genes and details of 
5 TGFp downstream signaling are not clear. 

To provide a basis for understanding the mechanism of action and molecular 
networks involved in the signaling of these cytokines, the early phase leading to 
polarization of Thl and Th2 cells in the presence and absence of TGFp was examined. As 
a result, genes differentially regulated in the cells induced to polarize to Thl and Th2 

10 subtypes in human were identified. Importantly, to our knowledge 77 of these genes have 
not been previously described to be involved in Thl and Th2 cell differentiation. In 
addition, we have further clarified which of the genes involved in fee early polarization of 
human Thl and Th2 cells are targets of IL-12 and IL-4 regulation and which of them are 
also targets of immunosuppressive TGFp. Wild type mouse cells or STAT6-knockout cells 

1 5 were used to further clarify the mechanism how IL-4 regulates gene expression through 
STAT6 signaling. These genes newly identified genes involved in Thl and Th2 
differentiation serve as therapeutic targets in achieving an appropriate balance between 
Thl and Th2 responses. 

SUMMARY 

20 Certain methods of the invention are related to a method of identifying a 

compound capable of modulating the polarization of CD4+ lymphocytes. The method 
includes the steps of contacting the compound with naive CD4+ lymphocytes, and then 
inducing the polarization of the cells. Further, a gene expression profile from the 
lymphocytes is prepared during the polarization, and the profile is compared to a baseline 

25 gene expression profile of CD4+ lymphocyte polarization as established in Table 1. A 
difference in the expression profiles of the target genes identifies a potential drug 
compound for the treatment of asthma or other immune-mediated diseases. 

The invention is also related to a methods of identifying a compound that 
modulates the expression or activity of at least one target gene listed in Table 2 or Table 6. 

30 The methods include the steps of (a) incubating a cell that can express a protein from said 
gene or a cell that has said activity with a compound under conditions and for a time 
sufficient for the cell to express the protein or activity of said gene, when the compound is 
not present, (b) incubating a control cell under the same conditions and for the same time 
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without the compound, (c) measuring expression or activity of said gene in the cell in the 
presence of the compound; (d) measuring expression or activity of said gene in the control 
cell; and (e) comparing the amount of expression or activity of said gene in the presence 
and absence of the compound, wherein a difference in the level of expression or activity 
indicates that the compound modulates the expression of said gene. 

Other methods that are provided are to identify a compound that modulates 
differentiation of a lymphocyte. These methods generally involve contacting a test cell 
capable of expressing one or more gene markers listed in Table 2 or Table 6 with a test 
compound. The expression level of the one or more gene markers in the test cell is 
determined. The expression level of these gene markers are than compared with the 
expression levels for these same markers in a control cell. In these methods, the test cell 
and the control cell are lymphocytes and the cellular state of the control cell is known. A 
difference in the expression level between the test and control cell is an indicator that the 
test compound is a modulator of lymphocyte differentiation. 

Another embodiment of the invention relates to a method of treating a patient 
with asthma or other immune-mediated disease. The method of treatment comprises 
administering to the patient a pharmaceutical composition that alters the expression or 
activity of at least one gene listed in Table 2 or Table 6. In a preferred embodiment of the 
invention, the active compound of said pharmaceutical composition is identified by a 
method of the invention. 

Methods for classifying a lymphocyte or assessing the cellular state of a 
lymphocytic cell are also provided. Certain of these methods involve providing a test 
sample derived from the lymphocyte, wherein the lymphocyte is capable of expressing one 
or more nucleic acid markers from the group consisting of those listed in one or more of 
the tables (e.g. Table 1 or Table 2 and/or Table 6). The expression level of the one or 
more markers in the test sample are determined and compared with the expression level of 
the same markers in a control sample. The control sample is derived from a lymphocytic 
cell whose cellular status is known. The lymphocyte is then classified on the basis of this 
comparison. In some instances, the methods involve classifying the lymphocyte as being a 
Thl or Th2 type cell. 

A variety of methods for diagnosing the presence of , or a predisposition to, an 
immune-related disease are provided as well. These methods generally involve 
determining the expression level of one or more nucleic acid markers in a test sample 
obtained from a subject. These markers are selected from the group consisting of those 
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listed in one or more of the tables (e.g. Table 1 or Table 2 and/or Table 6). The expression 
level of the one or more nucleic acid markers in the test sample is compared with the 
expression level of the same markers in a control sample whose immune status is known. 
The presence or absence of the immune disorder in the subject, or a predisposition to the 
5 immune disorder, is then diagnosed on the basis of this comparison. 



A BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1. Expression of Thl and Th2 marker genes. CD4+ cells were purified 
from human cord blood and were activated with plate-bound aCD3 (1000 ng/\i\ for 

10 goating) and 500 ng/|j,l soluble aCD28. The cells were further polarized with either 2.5 
ng/ml of IL-12 for Thl conditions or 10 ng/ml of IL-4 for Th2 conditions in the presence 
and absence of 3 ng/ml TGFp. Part of the activated cells were cultured in "neutral 
conditions" without any polarizing cytokines. The cells were collected at the time points 
of 0 h and 48 h. RNA was isolated from the samples and cDNA was prepared. Expression 

1 5 of known marker genes IFNy, T-bet and GATA-3 was measured from the samples using 
Real-Time RT-PCR to ensure the polarization of the cells to the Thl and Th2 direction. 
The figure shows representative data from one of two repeated experiments. 

FIGS.2A-2F. The target genes of activation, IL-12, IL-4 and TGFp. TheCD4+ 
cells were purified from human cord blood and were activated with plate-bound aCD3 

20 (1000 ng/jwl for goating) and 500 ng/|til soluble aCD28. The cells were further polarized 
with either 2.5 ng/ml of IL-12 for Thl conditions or 10 ng/ml of EL-4 for Th2 conditions in 
the presence and absence of 3 ng/ml TGFp. Part of the activated cells were cultured in 
"neutral conditions" without any polarizing cytokines. The samples were collected at the 
time points of 0 h and 48 h. The cRNAs were prepared for oligonucleotide microarray 

25 hybridizations and the data was analyzed with the MAS5 program. To identify the target 
genes of different treatments, the expression profiles of the samples were compared to each 
other: FIG. 2 A shows target genes of activation; FIG. 2B shows target genes of DL-12; 
FIG. 2C shows target genes of IL-4; FIG. 2D shows target genes of TGFp in Thl 
conditions; FIG. 2E shows target genes of TGFp in Th2 conditions; and FIG. 2F shows 

30 genes differentially expressed by Thl - and Th2-induced cells. The tonal intensities in the 
figures indicate the differences (signal log ratio) between two treatments. All the 
irreproducible results or changes below 2-fold (signal log ratio <1) were excluded from the 
results. Higher cut off (signal log ratio >4) for the target genes of activation was used to 
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reduce the number of the genes. The functional groups of the genes are represented as 
numbers (1. Cell surface molecules, 2. Cytokines, chemokines and other ligands, 3. 
Enzymes and pathway molecules, 4. Structural molecules and intracellular trafficking, 5. 
Transcriptional regulation, 6. Unclassified). 
5 FIG. 3. Validation of oligonucleotide microarray results with Real-Time RT- 

PCR. For validation of the oligonucleotide array results, long term Thl and Th2 primary 
cultures were generated from four individuals as previously described. The priming was 
performed in the presence of 100 ng/ml PHA (Murex Diagnostics, Chatillon, France) and 
irradiated CD32-B7 transfected fibroblasts. Thl cultures were supplemented with 2.5 

1 0 ng/ml of IL-12 (R&D Systems, Minneapolis, MN). Th2 cultures were supplemented with 
10 ng/ml of anti-IL-12 (R&D Systems) and 10 ng/ml of IL-4 (R&D System). After 48 
hours of priming, 40 U/ml of IL-2 (R&D Systems) was added into the cultures to enhance 
the proliferation of the lymphocytes. Part of the cells were cultured without any polarizing 
cytokines in the presence of IL-2 alone. During polarization, samples were collected at 

1 5 time points 0 h, 6 h, 24 h, 48 h or 7 d. Real-time quantitative RT-PCR was performed to 
quantitate the gene expression levels of SATB1, TIP3, DUSP6, E4BP4 and GADD450. 

DETAILED DESCRIPTION 

I. Definitions 

The terms "nucleic acid," "polynucleotide" and "oligonucleotide" are used 

20 interchangeably and refer to a deoxyribonucleotide or ribonucleotide polymer in either 
single- or double-stranded form, and unless otherwise limited, encompasses known 
analogues of natural nucleotides that hybridize to nucleic acids in a manner similar to 
naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid 
sequence includes the complementary sequence thereof. 

25 The term "target nucleic acid" refers to a nucleic acid (often derived from a 

biological sample), to which a polynucleotide probe is designed to specifically hybridize. 
It is either the presence or absence of the target nucleic acid that is to be detected, or the 
amount of the target nucleic acid that is to be quantified. The target nucleic acid has a 
sequence that is complementary to the nucleic acid sequence of the corresponding probe 

30 directed to the target. The term target nucleic acid can refer to the specific subsequence of 
a larger nucleic acid to which the probe is directed or to the overall sequence (e.g., gene or 
mRNA) whose expression level it is desired to detect. 
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A "probe" or "polynucleotide probe" is an nucleic acid capable of binding to a 
target nucleic acid of complementary sequence through one or more types of chemical 
bonds, usually through complementary base pairing, usually through hydrogen bond 
formation, thus forming a duplex structure. The probe binds or hybridizes to a "probe 
5 binding site." A probe can include natural (z.e., A, G, C, or T) or modified bases (7- 
deazaguanosine, inosine, etc.). A probe can be an oligonucleotide which is a single- 
stranded DNA. Polynucleotide probes can be synthesized or produced from naturally 
occurring polynucleotides. In addition, the bases in a probe can be joined by a linkage 
other than a phosphodiester bond, so long as it does not interfere with hybridization. Thus, 

10 probes can include, for example, peptide nucleic acids in which the constituent bases are 
joined by peptide bonds rather than phosphodiester linkages (see, e.g., Nielsen et ah, 
Science 254, 1497-1500 (1991)). Some probes can have leading and/or trailing sequences 
of noncomplementarity flanking a region of complementarity. 

A "perfectly matched probe" has a sequence perfectly complementary to a 

15 particular target sequence. The probe is typically perfectly complementary to a portion 
(subsequence) of a target sequence. The term "mismatch probe" refer to probes whose 
sequence is deliberately selected not to be perfectly complementary to a particular target 
sequence. 

A "primer" is a single-stranded oligonucleotide capable of acting as a point of 
20 initiation of template-directed DNA synthesis under appropriate conditions (z.e., in the 
presence of four different nucleoside triphosphates and an agent for polymerization, such 
as, DNA or RNA polymerase or reverse transcriptase) in an appropriate buffer and at a 
suitable temperature. The appropriate length of a primer depends on the intended use of 
the primer but typically ranges from 15 to 30 nucleotides, although shorter or longer 
25 primers can be used as well. Short primer molecules generally require cooler temperatures 
to form sufficiently stable hybrid complexes with the template. A primer need not reflect 
the exact sequence of the template but must be sufficiently complementary to hybridize 
with a template. The term "primer site" refers to the area of the target DNA to which a 
primer hybridizes. The term "primer pair" means a set of primers including a 5' "upstream 
30 primer" that hybridizes with the 5 ! end of the DNA sequence to be amplified and a 3 1 

"downstream primer" that hybridizes with the complement of the 3 1 end of the sequence to 
be amplified. 

The term "complementary" means that one nucleic acid is identical to, or 
hybridizes selectively to, another nucleic acid molecule. Selectivity of hybridization exists 
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when hybridization occurs that is more selective than total lack of specificity. Typically, 
selective hybridization will occur when there is at least about 55% identity over a stretch of 
at least 14-25 nucleotides, preferably at least 65%, more preferably at least 75%, and most 
preferably at least 90%. Preferably, one nucleic acid hybridizes specifically to the other 
5 nucleic acid. See M. Kanehisa, Nucleic Acids Res. 12:203 (1984). 

The terms "polypeptide," "peptide" and "protein" are used interchangeably to 
refer to a polymer of amino acid residues. The term also applies to amino acid polymers in 
which one or more amino acids are chemical analogues of a corresponding naturally 
occurring amino acids. 

10 A "subsequence" or "segment" refers to a sequence of nucleotides or amino acids 

that comprise a part of a longer sequence of nucleotides or amino acids (e.g., a 
polypeptide), respectively. 

The term "operably linked" refers to functional linkage between a nucleic acid 
expression control sequence (such as a promoter, signal sequence, or array of transcription 

15 factor binding sites) and a second polynucleotide, wherein the expression control sequence 
affects transcription and/or translation of the second polynucleotide. 

The terms "identical" or percent "identity," in the context of two or more nucleic 
acids or polypeptides, refer to two or more sequences or subsequences that are the same or 
have a specified percentage of nucleotides or amino acid residues that are the same, when 

20 compared and aligned for maximum correspondence, as measured using a sequence 

comparison algorithm such as those described below for example, or by visual inspection. 

The phrase "substantially identical," in the context of two nucleic acids or 
polypeptides, refers to two or more sequences or subsequences that have at least 75%, 
preferably at least 85%, more preferably at least 90%, 95% or higher nucleotide or amino 

25 acid residue identity, when compared and aligned for maximum correspondence, as 
measured using a sequence comparison algorithm such as those described below for 
example, or by visual inspection. Preferably, the substantial identity exists over a region of 
the sequences that is at least about 30 residues in length, preferably over a longer region 
than 50 residues, more preferably at least about 70 residues, and most preferably the 

30 sequences are substantially identical over the full length of the sequences being compared, 
such as the coding region of a nucleotide for example. For sequence comparison, typically 
one sequence acts as a reference sequence, to which test sequences are compared. When 
using a sequence comparison algorithm, test and reference sequences are input into a 
computer, subsequence coordinates are designated, if necessary, and sequence algorithm 
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program parameters are designated. The sequence comparison algorithm then calculates 
the percent sequence identity for the test sequence(s) relative to the reference sequence, 
based on the designated program parameters. 

Optimal alignment of sequences for comparison can be conducted, e.g., by the 
5 local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the 
homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443 (1970), by 
the search for similarity method of Pearson & Lipman, Proc. Natl Acad Set USA 
85:2444 (1988), by computerized implementations of these algorithms (GAP, BESTFIT, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 

10 Group, 575 Science Dr., Madison, WI), or by visual inspection (see, e.g., Current 
Protocols in Molecular Biology (Ausubel etai, 1995 supplement). 

One useful algorithm for conducting sequence comparisons is PILEUP. PILEUP 
uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol 
Evoi 35:351-360 (1987). The method used is similar to the method described by Higgins 

15 & Shaip, CABIOS 5: 15 1-153 (1989). Using PILEUP, a reference sequence is compared to 
other test sequences to determine the percent sequence identity relationship using the 
following parameters: default gap weight (3.00), default gap length weight (0.10), and 
weighted end gaps. PILEUP can be obtained from the GCG sequence analysis software 
package, e.g., version 7.0 (Devereaux et al. t Nuc. Acids Res. 12:387-395 (1984). 

20 Another example of algorithm that is suitable for determining percent sequence 

identity and sequence similarity is the BLAST and the BLAST 2.0 algorithms, which are 
described in Altschul et al., J. Mol. Biol. 215:403-410 (1990). Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology 
Information (http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high 

25 scoring sequence pairs (HSPs) by identifying short words of length W in the query 
sequence, which either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length in a database sequence. T is referred to as the 
neighborhood word score threshold (Altschul et al, supra.). These initial neighborhood 
word hits act as seeds for initiating searches to find longer HSPs containing them. The 

30 word hits are then extended in both directions along each sequence for as far as the 

cumulative alignment score can be increased. Cumulative scores are calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
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word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. 
5 For identifying whether a nucleic acid or polypeptide is within the scope of the 

invention, the default parameters of the BLAST programs are suitable. The BLASTN 
program (for nucleotide sequences) uses as defaults a word length (W) of 1 1 , an 
expectation (E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid 
sequences, the BLASTP program uses as defaults a word length (W) of 3, an expectation 

1 0 (E) of 1 0, and the BLOSUM 62 scoring matrix. The TBLATN program (using protein 
sequence for nucleotide sequence) uses as defaults a word length (W) of 3, an expectation 
(E) of 10, and a BLOSUM 62 scoring matrix. (See, e.g, Henikoff & Henikoff, Proa Natl 
Acad. Set USA 89:10915 (1989)). 

Another indication that two nucleic acid sequences are substantially identical is 

1 5 that the two molecules hybridize to each other under stringent conditions. "Bind(s) 

substantially" refers to complementary hybridization between a probe nucleic acid and a 
target nucleic acid and embraces minor mismatches that can be accommodated by reducing 
the stringency of the hybridization media to achieve the desired detection of the target 
polynucleotide sequence. The phrase "hybridizing specifically to", refers to the binding, 

20 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent conditions when that sequence is present in a complex mixture (e.g., total 
cellular) DNA or RNA. 

The term "stringent conditions" refers to conditions under which a probe will 
hybridize to its target subsequence, but to no other sequences. Stringent conditions are 

25 sequence-dependent and will be different in different circumstances. Longer sequences 
hybridize specifically at higher temperatures. Generally, stringent conditions are selected 
to be about 5 °C lower than the thermal melting point (Tm) for the specific sequence at a 
defined ionic strength and pH. The Tm is the temperature (under defined ionic strength, 
pH, and nucleic acid concentration) at which 50% of the probes complementary to the 

30 target sequence hybridize to the target sequence at equilibrium. (As the target sequences 
are generally present in excess, at Tm, 50% of the probes are occupied at equilibrium). 
Typically, stringent conditions will be those in which the salt concentration is less than 
about 1.0 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at 
pH 7.0 to 8.3 and the temperature is at least about 30 °C for short probes (e.g., 10 to 50 
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nucleotides) and at least about 60 °C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions can also be achieved with the addition of destabilizing agents such as 
formamide. 

A further indication that two nucleic acid sequences or polypeptides are 
5 substantially identical is that the polypeptide encoded by the first nucleic acid is 

immunologically cross reactive with the polypeptide encoded by the second nucleic acid, 
as described below. The phrases "specifically binds to a protein" or "specifically 
immunoreactive with," when referring to an antibody refers to a binding reaction which is 
determinative of the presence of the protein in the presence of a heterogeneous population 

10 of proteins and other biologies. Thus, under designated immunoassay conditions, a 
specified antibody binds preferentially to a particular protein and does not bind in a 
significant amount to other proteins present in the sample. Specific binding to a protein 
under such conditions requires an antibody that is selected for its specificity for a particular 
protein. A variety of immunoassay formats may be used to select antibodies specifically 

1 5 immunoreactive with a particular protein. For example, solid-phase ELISA immunoassays 
are routinely used to select monoclonal antibodies specifically immunoreactive with a 
protein. See, e.g. y Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring 
Harbor Publications, New York, for a description of immunoassay formats and conditions 
that can be used to determine specific immunoreactivity. 

20 "Conservatively modified variations" of a particular polynucleotide sequence 

refers to those polynucleotides that encode identical or essentially identical amino acid 
sequences, or where the polynucleotide does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 
number of functionally identical nucleic acids encode any given polypeptide. For instance, 

25 the codons CGU, CGC, CGA, CGG, AGA, and AGG all encode the amino acid arginine. 
Thus, at every position where an arginine is specified by a codon, the codon can be altered 
to any of the corresponding codons described without altering the encoded polypeptide. 
Such nucleic acid variations are "silent variations," which are one species of 
"conservatively modified variations." Every polynucleotide sequence described herein 

30 which encodes a polypeptide also describes every possible silent variation, except where 
otherwise noted. One of skill will recognize that each codon in a nucleic acid (except 
AUG, which is ordinarily the only codon for methionine) can be modified to yield a 
functionally identical molecule by standard techniques. Accordingly, each "silent 
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variation" of a nucleic acid which encodes a polypeptide is implicit in each described 
sequence. 

A polypeptide is typically substantially identical to a second polypeptide, for 
example, where the two peptides differ only by conservative substitutions. A 
5 "conservative substitution," when describing a protein, refers to a change in the amino acid 
composition of the protein that does not substantially alter the protein's activity. Thus, 
"conservatively modified variations" of a particular amino acid sequence refers to amino 
acid substitutions of those amino acids that are not critical for protein activity or 
substitution of amino acids with other amino acids having similar properties {e.g., acidic, 

1 0 basic, positively or negatively charged, polar or non-polar, etc.) such that the substitutions 
of even critical amino acids do not substantially alter activity. Conservative substitution 
tables providing functionally similar amino acids are well-known in the art. See, e.g., 
Creighton (1984) Proteins, W.H. Freeman and Company. In addition, individual 
substitutions, deletions or additions which alter, add or delete a single amino acid or a 

15 small percentage of amino acids in an encoded sequence are also "conservatively modified 
variations." 

The term "naturally occurring" as applied to an object refers to the fact that an 
object can be found in nature. For example, a polypeptide or polynucleotide sequence that 
is present in an organism that can be isolated from a source in nature and which has not 

20 been intentionally modified by humans in the laboratory is naturally occurring. 

The term "antibody" refers to a protein consisting of one or more polypeptides 
substantially encoded by immunoglobulin genes or fragments of immunoglobulin genes. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon and mu constant region genes, as well as myriad immunoglobulin variable region 

25 genes. Light chains are classified as either kappa or lambda. Heavy chains are classified 
as gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, 
IgG, IgM, IgA, IgD and IgE, respectively. 

A typical immunoglobulin (antibody) structural unit comprises a tetramer. Bach 
tetramer is composed of two identical pairs of polypeptide chains, each pair having one 

30 "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each 
chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (VL) and variable 
heavy chain (VH) refer to these light and heavy chains respectively. 
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Antibodies exist as intact immunoglobulins or as a number of well- characterized 
fragments produced by digestion with various peptidases. Thus, for example, pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab) ' 2 , a 
dimer of Fab which itself is a light chain joined to VH-CH1 by a disulfide bond. The 
5 F(ab) f 2 may be reduced under mild conditions to break the disulfide linkage in the hinge 
region thereby converting the (Fab') 2 dimer into an Fab 1 monomer. The Fab 1 monomer is 
essentially an Fab with part of the hinge region (see, Fundamental Immunology, W.E. Paul, 
ed., Raven Press, N.Y. (1993), for a more detailed description of other antibody 
fragments). While various antibody fragments are defined in terms of the digestion of an 

10 intact antibody, one of skill will appreciate that such Fab' fragments may be synthesized de 
novo either chemically or by utilizing recombinant DNA methodology. Thus, the term 
antibody, as used herein also includes antibody fragments either produced by the 
modification of whole antibodies or synthesized de novo using recombinant DNA 
methodologies. Preferred antibodies include single chain antibodies, more preferably 

1 5 single chain Fv (scFv) antibodies in which a variable heavy and a variable light chain are 
joined together (directly or through a peptide linker) to form a continuous polypeptide. 

A single chain Fv ("scFv" or "scFv") polypeptide is a covalently linked VH::VL 
heterodimer which may be expressed from a nucleic acid including VH- and VL- encoding 
sequences either joined directly or joined by a peptide-encoding linker. Huston, et al. 

20 Proa Nat Acad Scl USA, 85:5879-5883 (1988). A number of structures for converting 
the naturally aggregated- but chemically separated light and heavy polypeptide chains 
from an antibody V region into an scFv molecule which will fold into a three dimensional 
structure substantially similar to the structure of an antigen-binding site. See, e.g. U.S. 
Patent Nos. 5,091,513 and 5,132,405 and 4,956,778. 

25 An "antigen-binding site" or "binding portion" refers to the part of an 

immunoglobulin molecule that participates in antigen binding. The antigen binding site is 
formed by amino acid residues of the N-terminal variable ("V") regions of the heavy ("H") 
and light ("L") chains. Three highly divergent stretches within the V regions of the heavy 
and light chains are referred to as "hypervariable regions" which are interposed between 

30 more conserved flanking stretches known as "framework regions" or "FRs". Thus, the 

■ 

term "FR" refers to amino acid sequences that are naturally found between and adjacent to 
hypervariable regions in immunoglobulins. In an antibody molecule, the three 
hypervariable regions of a light chain and the three hypervariable regions of a heavy chain 
are disposed relative to each other in three dimensional space to form an antigen binding 
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"surface". This surface mediates recognition and binding of the target antigen. The three 
hypervariable regions of each of the heavy and light chains are referred to as 
"complementarity determining regions" or "CDRs" and are characterized, for example by 
Kabat et al. Sequences of proteins of immunological interest, 4th ed. U.S. Dept. Health 
5 and Human Services, Public Health Services, Bethesda, MD (1987). 

The term "antigenic determinant" refers to the particular chemical group of a 
molecule that confers antigenic specificity. 

The term "epitope" generally refers to that portion of an antigen that interacts 
with an antibody. More specifically, the term epitope includes any protein determinant 

1 0 capable of specific binding to an immunoglobulin or T-cell receptor. Specific binding 
exists when the dissociation constant for antibody binding to an antigen is < 1 faM, 
preferably < 100 nM and most preferably < 1 nM. Epitopic determinants usually consist of 
chemically active surface groupings of molecules such as amino acids and typically have 
specific three dimensional structural characteristics, as well as specific charge 

15 characteristics. 

The term "specific binding" (and equivalent phrases) refers to the ability of a 
binding moiety (e.g., a receptor, antibody, ligand or antiligand) to bind preferentially to a 
particular target molecule (e.g., ligand or antigen) in the presence of a heterogeneous 
population of proteins and other biologies (i.e., without significant binding to other 

20 components present in a test sample). Typically, specific binding between two entities, 
such as a ligand and a receptor, means a binding affinity of at least about 10 M" , and 
preferably at least about 10 7 , 10 8 , 10 9 , or 10 10 M" 1 . 

A "subject" generally refers to an organism from which lymphocytes can be 
obtained. Usually the subject is a mammal. The mammal can be a primate (e.g., a human, 

25 monkey, ape, or chimpanzee), or a non-primate (e.g., a mouse). 

II. Overview 

Many biological functions are controlled through changes in the expression of 
various genes by transcriptional (e.g., through control of initiation, RNA processing, etc.) 
and/or translational control. For example, fundamental biological processes such as cell 
30 cycle, cell differentiation and cell death, are often characterized by the variations in the 
expression levels of groups of genes (see e.g. WO02059271). The changes in gene 
expression also are associated with pathogenesis. Thus, changes in the expression levels of 
particular genes can indicate the presence and progression of various diseases. 
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According to the invention, genes that are differentially expressed during 
cytokine induced CD4+ lymphocyte polarization in both the presence and absence of 
TGFB have been discovered. One or more of these target genes can be used as part of an 
"an expression profile" that is representative of a particular state of a lymphocyte. 
5 Identification of these new target genes enable immune-mediated diseases to be analyzed 
more reliably. These results also provide new insights into T cell differentiation and reveal 
new potential target genes for the therapy of diseases such as asthma. These differentially 
expressed genes and their corresponding proteins can also be utilized as "markers" that 
characterize particular cellular states for lymphocytes. 

10 The differentially expressed genes that have been identified can be utilized in a 

variety of methods for classifying lymphocytes, as well as diagnosing and treating 
immune-mediated diseases (e.g., asthma, allergic responses and autoimmune diseases). 
Kits and devices including one or more of the differentially expressed genes, proteins 
encoded by these genes and/or antibodies that bind the proteins are also provided. 

15 For example, the differentially expressed genes can be used to in screening 

methods to identify compounds that modulate the expression or activity of the 
differentially expressed genes. Such methods can be utilized, for example, for the 
identification of compounds that can treat symptoms of disorders related to expression of 
proteins encoded by the differentially expressed genes. In addition, the invention 

20 encompasses methods for treating immune-mediated diseases or disorders by 

administering compounds and/or other substances that modulate the activity of one or more 
of the target genes or target gene products. Such compounds and other substances can 
effect the modulation either on the level of target gene expression or target protein activity. 
Certain classification methods that are also provided involve determining the level of one 

25 or more of the differentially expressed genes to determine whether a lymphocyte has been 
polarized in the Thl or Th2 direction. 

III. Differentially Expressed Genes 

As described more fully in the examples below, an initial set of experiments were 
conducted to identify the gene expression profiles of CD4+ cells induced in the Thl and 
30 Th2 directions in the presence and absence of TGFB. This allowed those genes involved in 
early polarization to be identified, as well providing insight regarding which genes are 
involved in the immunosuppressive effect of TGFJS. Another set of experiments was then 
conducted to identify those genes in lymphocytes that are regulated by the cytokines IL-12 
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and IL-4. Most of the differentially expressed genes were identified using oligonucleotide 
arrays; the differential expression of certain genes was confirmed using real-time PCR 
approaches. 

The differentially expressed genes include, for instance, those identified under the 
5 following sets of conditions or states: 

(a) CD4+ cells activated by contact with CD3/CD28 versus unactivated CD4+ 
cells; these genes correspond to target genes of activation {see, e.g. FIG. 2A); 

(b) activated CD4+ cells further activated with IL-12 versus activated CD4+ 
cells; these genes are representative of Thl cells and correspond to target genes of IL-12 

10 {see, e.g. FIG. 2B); 

c) activated CD4+ cells further activated with IL-4 versus activated CD4+ 
cells; these genes are representative of Th2 cells and correspond to target genes of IL-4 
{see, e.g. FIG. 2C); 

(d) activated cells polarized with IL-12 exposed to TGFB versus similar cells not 
1 5 exposed to TGFB; these correspond to genes in Thl cells that are affected by TGFB {see, 

e.g. FIG. 2D); 

(e) activated cells polarized with BL-4 exposed to TGFB versus similar cells not 
exposed to TGFB; these correspond to genes in Th2 cells that are affected by TGFB {see, 
e.g. FIG. 2E); and 

20 (0 g en es differentially expressed in Thl cells versus Th2 cells {see, e.g. FIG. 

2F). 

As discussed in greater detail below, knowledge of the nucleic acids that are up- 
regulated or down-regulated in the various types of lymphocytes and in different cellular 
states provides the basis for a number of different screening, treatment and diagnostic 

25 methods, in addition to devices to carry out these methods. The differentially expressed 
genes include both "fingerprint genes" and "control genes." "Fingerprint genes" are those 
nucleic acids that correlate with a particular type of lymphocyte {e.g., Thl or Th2), or a 
particular cellular state {e.g., activated or non-activated). As described in greater detail 
below, fingerprint genes can be used in the development of a variety of different screening 

30 and diagnostic methods to classify types of lymphocytes and/or to aid in the diagnosis of 
particular disease conditions. A "control gene" is one that encodes a protein that is 
involved in a lymphocyte assuming a particular state or becoming a particular type of cell. 
Because of the role such genes play, control genes are useful targets for the development of 
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compound discovery programs and pharmaceutical development such as described infra. 
In some instances, a fingerprint gene can be a control gene and vice versa. 

Expression levels for combinations of differentially expressed genes, in particular 
fingerprint genes, can be used to develop "expression profiles" that are characteristic of a 
5 particular type of lymphocyte or cellular state. Expression profiles as used herein refers to 
the pattern of gene expression corresponding to at least one differentially expressed genes, 
but typically includes a plurality of genes. For instance, an expression profile can include 
at least 1, 2, 3, 4 or 5 differentially expressed genes, but in other instances can include at 
least 6, 7, 8, 9, 10, 12, 14, 16, 18, 20, 25, 30, 35, 40, 45 or 50 or more differentially 

10 expressed genes. In some instances, expression profiles include all of the differentially 
expressed genes known for a particular type of lymphocyte or cellular state. So, for 
example, certain expression profiles include a measure (quantitative or qualitative) of the 
expression level for each of the differentially expressed genes in one or more of the tables 
or figures (e.g., Table 1 or Table 2 and/or Table6). 

15 The pattern of expression associated with gene expression profiles can be defined 

in several ways. For example, a gene expression profile can be the absolute (e.g., a 
measured value) or relative transcript level of any number of particular differentially 
expressed genes. In other instances, a gene expression profile can be defined by 
comparing the level of expression of a variety of genes in one state to the level of 

20 expression of the same genes in another state (e.g., activated versus unactivated), or 
between one cell type and another cell type (e.g., Thl versus Th2). 

As used herein, the term "differentially expressed gene" or "differentially 
expressed nucleic acid" refers to the specific sequence as set forth in the particular 
GenBank entry that is provided herein (see, e.g., the tables and figures). The term, 

25 however, is also intended to include more broadly naturally occurring sequences (including 
allelic variants of those listed for the GenBank entries), as well as synthetic and 
intentionally manipulated sequences (e.g,, nucleic acids subjected to site-directed 
mutagenesis). It is noted that the sequences of the target genes listed in the tables and 
figures are available in the public databases. The tables provide the accession number and 

30 name for each of the sequences. The sequences of the genes in GenBank are herein 

expressly incorporated by reference in their entirety as of the filing date of this application 
(see www.ncbi.nim.nih.gov). 

Differentially expressed nucleic acids also include sequences that are 
complementary to the listed sequences, as well as degenerate sequences resulting from the 
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degeneracy of the genetic code. Thus, the differentially expressed nucleic acids include: 
(a) nucleic acids having sequences corresponding to the sequences as provided in the listed 
GenBank accession number; (b) nucleic acids that encode amino acids encoded by the 
nucleic acids of (a); (c) a nucleic acid that hybridizes under stringent conditions to a 
5 complement of the nucleic acid of (a); and (d) nucleic acids that hybridize under stringent 
conditions to, and therefore are complements of, the nucleic acids described in (a) through 
(c). The differentially expressed nucleic acids of the invention also include: (a) a 
deoxyribonucleotide sequence complementary to the full-length nucleotide sequences 
corresponding to the listed GenBank accession numbers; (b) a ribonucleotide sequence 

10 complementary to the full-length sequence corresponding to the listed GenBank accession 
numbers; and (c) a nucleotide sequence complementary to the deoxyribonucleotide 
sequence of (a) and the ribonucleotide sequence of (b). The differentially expressed 
nucleic acids further include fragments of the foregoing sequences. For example, nucleic 
acids including 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 40, 50, 60, 70, 80, 90, 100, 125, 

15 150, 175, 200, 225, 250, 275 or 300 contiguous nucleotides (or any number of nucleotides 
therebetween) from a differentially expressed nucleic acid are included. Such fragments 
are useful, for example, as primers and probes for hybridizing full-length differentially 
expressed nucleic acids (e.g., in detecting and amplifying such sequences). 

In some instances, the differentially expressed nucleic acids include 

20 conservatively modified variations. Thus, for example, in some instances, the 

differentially expressed nucleic acids are modified. One of skill will recognize many ways 
of generating alterations in a given nucleic acid construct. Such well-known methods 
include site-directed mutagenesis, PCR amplification using degenerate polynucleotides, 
exposure of cells containing the nucleic acid to mutagenic agents or radiation and chemical 

25 synthesis of a desired polynucleotide (e.g., in conjunction with ligation and/or cloning to 
generate large nucleic acids). See, e.g., Giliman and Smith (1979) Gene 8:81-97, Roberts 
et aL (1987) Nature 328: 73 1-734). When the differentially expressed nucleic acids are 
incorporated into vectors, the nucleic acids can be combined with other sequences 
including, but not limited to, promoters, polyadenylation signals, restriction enzyme sites 

30 and multiple cloning sites. Thus, the overall length of the nucleic acid can vary 
considerably. 

Certain differentially expressed nucleic acids of the invention include 
polynucleotides that are substantially identical to a polynucleotide sequence as set forth in 
SEQ ID NO: 1 . Such nucleic acids can function as new markers for certain types of 
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lymphocytes and for different cellular states for lymphocytes. For example, the invention 
includes polynucleotide sequences that are at least 80%, 85%, 90%, 92%, 94%, 96%, 98% 
or 1 00% identical to the polynucleotide sequences provided in the GenBank entries listed 
in the tables. Identity is typically measured over at least 40, 50, 60, 70, 80, 90 or 100 
5 contiguous nucleotides. In other instances, identity is measured over a region of at least 
150, 200, or 250 nucleotides in length. In yet other instances, the region of similarity 
exceeds 250 nucleotides in length and extends for at least 300, 350, 400, 450 or 500 
nucleotides in length, or over the entire length of the sequence. 

As described above, sequence identity comparisons can be conducted using a 
10 nucleotide sequence comparison algorithm such as those know to those of skill in the art. 
For example, one can use the BLASTN algorithm. Suitable parameters for use in 
BLASTN are wordlength (W) of 1 1, M=5 and N=-4 and the identity values and region 
sizes just described. 

B. Preparation of Differentially Expressed Genes 

15 The differentially expressed nucleic acids can be obtained by any suitable method 

known in the art, including, for example: (1) hybridization of genomic or cDNA libraries 
with probes to detect homologous nucleotide sequences; (2) antibody screening of 
expression libraries to detect cloned DNA fragments with shared structural features; (3) 
various amplification procedures such as polymerase chain reaction (PCR) using primers 

20 capable of annealing to the nucleic acid of interest; and (4) direct chemical synthesis. 

The desired nucleic acids can also be cloned using well-known amplification 
techniques. Examples of protocols sufficient to direct persons of skill through in vitro 
amplification methods, including the polymerase chain reaction (PCR) the ligase chain 
reaction (LCR), QP-replicase amplification and other RNA polymerase mediated 

25 techniques, are found in Berger, Sambrook, and Ausubel, as well as Mullis et al (1987) 
U.S. Patent No. 4,683,202; PCR Protocols A Guide to Methods and Applications (Innis et 
al. eds) Academic Press Inc. San Diego, CA (1990) (Innis); Arnheim & Levinson (October 
1, 1990) C&EN 36-47; The Journal Of NIH Research (1991) 3: 81-94; (Kwoh etal. 
(1989) Proa Natl. Acad. Sci. USA 86: 1173; Guatelli etal. (1990) Proc. Natl. Acad. Sci. 

30 USA 87: 1874; Lomell et al (1989) J. Clin. Chem. 35: 1826; Landegren et al. (1988) 
Science 241: 1077-1080; Van Brunt (1990) Biotechnology 8: 291-294; Wu and Wallace 
(1989) Gene 4: 560; and Barringer et al. (1990) Gene 89: 1 17. Improved methods of 
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cloning in vitro amplified nucleic acids are described in Wallace et al, U.S. Pat. No. 
5,426,039. 

As an alternative to cloning a nucleic acid, a suitable nucleic acid can be 
chemically synthesized. Direct chemical synthesis methods include, for example, the 
5 phosphotriester method of Narang et al (1979) Metk Enzymol 68: 90-99; the 
phosphodiester method of Brown et al (1979) Metk Enzymol 68: 109-151; the 
diethylphosphoramidite method of Beaucage etal (1981) Tetra. Lett., 22: 1859-1862; and 
the solid support method described in U.S. Patent No. 4,458,066. Chemical synthesis 
produces a single stranded polynucleotide. This can be converted into double stranded 

1 0 DNA by hybridization with a complementary sequence, or by polymerization with a DNA 
polymerase using the single strand as a template. While chemical synthesis of DNA is 
often limited to sequences of about 1 00 bases, longer sequences can be obtained by the 
ligation of shorter sequences. Alternatively, subsequences can be cloned and the 
appropriate subsequences cleaved using appropriate restriction enzymes. The fragments 

1 5 can then be ligated to produce the desired DNA sequence. 

C. Utility of Differentially Expressed Nucleic Acids and Expression Profiles 

As alluded to above and described in greater detail below, the differentially 
expressed nucleic acids that are provided can be used as markers in a variety of screening 
and diagnostic methods. For example, the differentially expressed nucleic acids find utility 

20 as hybridization probes or amplification primers. In certain instances, these probes and 
primers are fragments of the differentially expressed nucleic acids of the lengths described 
earlier in this section. Such fragments are generally of sufficient length to specifically 
hybridize to an RNA or DNA in a sample obtained from a subject. The nucleic acids are 
typically 10-30 nucleotides in length, although they can be longer as described above. The 

25 probes can be used in a variety of different types of hybridization experiments, including, 
but not limited to, Northern blots and Southern blots and in the preparation of custom 
arrays (see infra). The differentially expressed nucleic acids can also be used in the design 
of primers for amplifying the differentially expressed nucleic acids and in the design of 
primers and probes for quantitative RT-PCR. The primers most frequently include about 

30 20 to 30 contiguous nucleotides of the differentially expressed nucleic acids to obtain the 
desired level of stability and thus selectivity in amplification, although longer sequences as 
described above can also be utilized. 
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Hybridization conditions are varied according to the particular application. For 
applications requiring high selectivity (e.g., amplification of a particular sequence), 
relatively stringent conditions are utilized, such as 0.02 M to about 0.10 M NaCl at 
temperatures of about 50 °C to about 70 °C. High stringency conditions such as these 
5 tolerate little, if any, mismatch between the probe and the template or target strand of the 
differentially expressed nucleic acid. Such conditions are useful for isolating specific 
genes or detecting particular mRNA transcripts, for example. 

Other applications, such as substitution of amino acids by site-directed 
mutagenesis, require less stringency. Under these conditions, hybridization can occur even 

10 though the sequences of the probe and target nucleic acid are not perfectly complementary, 
but instead include one or more mismatches. Conditions can be rendered less stringent by 
increasing the salt concentration and decreasing temperature. For example, a medium 
stringency condition includes about 0.1 to 0.25 M NaCI at temperatures of about 37 °C to 
about 55 °C. Low stringency conditions include about 0.1 5M to about 0.9 M salt, at 

15 temperatures ranging from about 20 °C to about 55 °C. 



V. Proteins 

A. General 
The differentially expressed nucleic acids that have been identified can be 

inserted into any of a number of known expression systems to generate large amounts of 
20 the protein encoded by the gene or gene fragment. Such proteins can then be utilized in the 
preparation of antibodies. Proteins encoded by target genes can be utilized in the 
compound development programs described below and in the preparation of various 
diagnostics (e.g., antibody arrays). 

The polypeptides can be isolated from natural sources, and/or prepared according 
25 to recombinant methods, and/or prepared by chemical synthesis, and/or prepared using a 
combination of recombinant methods and chemical synthesis. Besides substantially full- 
length polypeptides, biologically active fragments of the polypeptides are also provided. 
Biological activity can include, for example, antibody binding (e.g., the fragment competes 
with a full-length polypeptide) and immunogenicity (i.e. 9 possession of epitopes that 
30 stimulate B- or T-cell responses against the fragment). Such fragments generally comprise 
at least 5 contiguous amino acids, typically at least 6 or 7 contiguous amino acids, in other 
instances 8 or 9 contiguous amino acids, usually at least 10, 1 1 or 12 contiguous amino 
acids, in still other instances at least 13 or 14 contiguous amino acids, in yet other instances 
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at least 16 contiguous amino acids, and in some cases at least 20, 40, 60 or 80 contiguous 
amino acids. 

Often the polypeptides will share at least one antigenic determinant in common 
with the amino acid sequence of the full-length polypeptide. The existence of such a 
5 common determinant is evidenced by cross-reactivity of the variant protein with any 

antibody prepared against the full-length polypeptide. Cross-reactivity can be tested using 
polyclonal sera against the full-length polypeptide, but can also be tested using one or 
more monoclonal antibodies against the full-length polypeptide. 

The polypeptides include conservative variations of the naturally occurring 

1 0 polypeptides. Such variations can be minor sequence variations of the polypeptide that 

arise due to natural variation within the population (e.g, single nucleotide polymorphisms) 
or they can be homologs found in other species. They also can be sequences that do not 
occur naturally but that are sufficiently similar so that they function similarly and/or elicit 
an immune response that cross-reacts with natural forms of the polypeptide. Sequence 

15 variants can be prepared by standard site-directed mutagenesis techniques. The 

polypeptide variants can be substitutional, insertional or deletion variants. Deletion 
variants lack one or more residues of the native protein that are not essential for function or 
immunogenic activity (e.g., polypeptides lacking transmembrane or secretory signal 
sequences). Substitutional variants involve conservative substitutions of one amino acid 

20 residue for another at one or more sites within the protein and can be designed to modulate 
one or more properties of the polypeptide such as stability against proteolytic cleavage. 
Insertional variants include, for example, fusion proteins such as those used to allow rapid 
purification of the polypeptide and also can include hybrid proteins containing sequences 
from other polypeptides, which are homologues of the polypeptide. The foregoing 

25 variations can be utilized to create equivalent, or even an improved, second-generation 
polypeptide. Preparation of variants is well known in the art (see, e.g., Creighton (1984) 
Proteins, W.H. Freeman and Company, which is incorporated herein by reference in its 
entirety for all purposes). 

The polypeptides that are provided also include those in which the polypeptide 

30 has a modified polypeptide backbone. Examples of such modifications include chemical 
derivatizations of polypeptides, such as acetylations and carboxylations. Modifications 
also include glycosylation modifications and processing variants of a typical polypeptide. 
Such processing steps specifically include enzymatic modifications, such as 
ubiquitinization and phosphorylation. See, e.g., Hershko & Ciechanover, Ann. Rev. 
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Biochem. 51:335-364 (1982). Also included are mimetics, which are peptide-containing 
molecules that mimic elements of protein secondary structure (see, e.g., Johnson, et aL, 
"Peptide Turn Mimetics" in Biotechnology and Pharmacy, (Pezzuto et al., Eds.), Chapman 
and Hall, New York (1993)). Peptide mimetics are typically designed so that side chain 
5 groups extending from the backbone are oriented such that the side chains of the mimetic 
can be involved in molecular interactions similar to the interactions of the side chains in 
the native protein. 

B. Production of Polypeptides 

1. Recombinant Technologies 

10 The polypeptides encoded by the differentially expressed nucleic acids can be 

expressed in hosts after the coding sequences have been operably linked to an expression 
control sequence in an expression vector. Expression vectors are typically replicable in the 
host organisms either as episomes or as an integral part of the host chromosomal DNA. 
Expression vectors commonly contain selection markers, e.g, tetracycline resistance or 

15 hygromycin resistance, to permit detection and/or selection of those cells transformed with 
the desired DNA sequences (see, e.g., U.S. Patent 4,704,362). 

A differentially expressed gene typically is placed under the control of a promoter 
that is functional in the desired host cell to produce relatively large quantities of a 
polypeptide of the invention. An extremely wide variety of promoters are well known to 

20 those of skill, and can be used in the expression vectors, depending on the particular 

application. Ordinarily, the promoter selected depends upon the cell in which the promoter 
is to be active. Other expression control sequences such as ribosome binding sites, 
transcription termination sites and the like are also optionally included. Constructs that 
include one or more of such control sequences are termed "expression cassettes." 

25 Accordingly, expression cassettes are provided into which the differentially expressed 
nucleic acids are incorporated for high level expression of the corresponding protein in a 
desired host cell. 

A wide variety of cloning and in vitro amplification methods suitable for the 
construction of recombinant nucleic acids is described, for example, in Berger and 
30 Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, Volume 152, 
Academic Press, Inc., San Diego, CA (Berger); and "Current Protocols in Molecular 
Biology," F.M. Ausubel et at, eds., Current Protocols, a joint venture between Greene 
Publishing Associates, Inc. and John Wiley & Sons, Inc., (1998 Supplement) (Ausubel). 
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2. Naturally Occurring Polypeptides 

Naturally occurring polypeptides encoded by the differentially expressed genes 

can also be isolated using conventional techniques such as affinity chromatography. For 

example, polyclonal or monoclonal antibodies can be raised against the polypeptide of 

5 interest and attached to a suitable affinity column by well-known techniques. See, e.g., 

Hudson & Hay, Practical Immunology (Blackwell Scientific Publications, Oxford, UK, 

1980), Chapter 8 (incorporated by reference in its entirety). Peptide fragments can be 

generated from intact polypeptides by chemical or enzymatic cleavage methods known to 

those of skill in the art. 

10 3. Other Methods 

Alternatively, the polypeptides encoded by differentially expressed genes or gene 

fragments can be synthesized by chemical methods or produced by in vitro translation 

systems using a polynucleotide template to direct translation. Methods for chemical 

synthesis of polypeptides and in vitro translation are well-known in the art, and are 

15 described further by Berger & Kimmel, Methods in Enzymology, Volume 152, Guide to 

Molecular Cloning Techniques, Academic Press, Inc., San Diego, CA, 1987 (incorporated 

by reference in its entirety). 

C. Utility 

The polypeptides can be used to generate antibodies that specifically bind to 
20 epitopes associated with the polypeptides or fragments thereof. Commercially available 
computer sequence analysis can be used to determine the location of the predicted major 
antigenic determinant epitopes of the polypeptide {e.g., MacVector from IBI, New Haven, 
Conn.). Once such an analysis has been performed, polypeptides can be prepared that 
contain at least the essential structural features of the antigenic determinant and can be 
25 utilized in the production of antisera against the polypeptide. Minigenes or gene fusions 
encoding these determinants can be constructed and inserted into expression vectors such 
as those described above using standard techniques. The major antigenic determinants can 
also be determined empirically in which portions of the gene encoding the polypeptide are 
expressed in a recombinant host, and the resulting proteins tested for their ability to elicit 
30 an immune response. For example, PCR can be used to prepare a range of cDNAs 
encoding polypeptides lacking successively longer fragments of the C-terminus of the 
polypeptide. The immunoprotective activity of each of these polypeptides then identifies 
those fragments or domains of the polypeptide that are essential for this activity. Further 
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experiments in which only a small number or amino acids are removed at each iteration 
then allows the location of the antigenic determinants of the polypeptide. 

Polypeptides encoded by target genes can be utilized in the development of 
pharmaceutical compositions, for example that modulate gene products associated 
5 cancerous cells. The process for identifying such polypeptides and subsequent compound 
development is described further below. 

VI. Exemplary Screening. Diagnostic and Classification Methods 
A. General Considerations 

Certain methods that are provided involve determining the expression level of 

10 one or more of the differentially expressed genes in a test cell population with the 
expression level of the same genes in a control cell population, or comparing the 
expression profile for one sample with an expression profile determined for another 
sample. The level of expression of the differentially expressed nucleic acids can be 
determined at either the nucleic acid level or the protein level. Thus, the phrase 

15 "determining the expression level," "preparing a gene expression profile," and other like 
phrases when used in reference to the differentially expressed nucleic acids means that 
transcript levels and/or levels of protein encoded by the differentially encoded nucleic 
acids are detected. When determining the level of expression, the level can be determined 
qualitatively, but generally is determined quantitatively. 

20 Based upon the sequence information that is disclosed herein, coupled with the 

nucleic acid and protein detection methods that are described herein and that are known in 
the art, expression levels of these genes can readily determined. If transcript levels are 
determined, they can be determined using routine methods. For instance, the sequence 
information provided herein (e.g., GenBank sequence entries) can be used to construct 

25 nucleic acid probes using conventional methods such as various hybridization detection 
methods (e.g., Northern blots). Alternatively, the provided sequence information can be 

' mm 

used to generate primers that in turn are used to amplify and detect differentially expressed 
nucleic acids that are present in a sample (e.g., quantitative RT-PCR methods). If instead 
expression is detected at the protein level, encoded protein can be detected and optionally 
30 quantified using any of a number of established techniques. One common approach is to 
use antibodies that specifically bind to the protein product in immunoassay methods. 
Additional details regarding methods of conducting differential gene expression are 
provided infra. 



WO 2004/083366 PCT/FI2004/000155 

25 

Expression levels can be detected for one, some, or all of the differentially 
expressed nucleic acids that are listed in one or more of the tables (e.g., Table 1, or Table 
2, or Table 6). With some methods, the expression levels for only 1, 2, 3, 4 or 5 
differentially expressed nucleic acids are determined. In other methods, expression levels 
5 for at least 6, 7, 8, 9 or 10 differentially expressed nucleic acids are determined. In still 
other methods, expression levels for at least 15, 20, 25, 30, 35, 40, 45, 50, 55, or 60 
differentially expressed nucleic acids are determined. In yet other methods, all of the 
differentially expressed genes in one or more of the tables are determined. 

Determination of expression levels is typically done with a test sample taken 

10 from a test cell population. As used herein, the term "population" when used in reference 
to a cell can mean a single cell, but typically refers to a plurality of cells (e.g., a tissue 
sample). Certain screening methods are performed with test cells that are "capable of 
expressing" one or more of the differentially expressed nucleic acids. As used in this 
context, the phrase "capable or expressing" means that the gene of interest is in intact form 

15 and can be expressed within the cell. 

A number of the methods that are provided involve a comparison of expression 
levels for certain differentially expressed nucleic acids in a "test cell" with the expression 
levels for the same nucleic acids in a "control cell" (also sometimes referred to as a 
"control sample," a "reference cell," a "reference value," or simply a "control"). Other 

20 methods involve a comparison between one expression profile and a baseline expression 
profile. In either case, the expression level for the control cell or baseline expression 
profile essentially establishes a baseline against which an experimental value is compared. 
The comparison of expression levels are meant to be interpreted broadly with respect to 
what is meant by: 1) the term "cell", 2) the time at which the expression levels for test and 

25 control cells are determined, and 3) with respect to the measure of the expression levels. 

So, for example, although the term "test cell" and "control cell" is used for 
convenience, the term "cell" is meant to be construed broadly. A cell, for instance, can 
also refer to a population of cells (e.g., a tissue sample), just as a population of cells can 
have a single member. The cell may in some instances be a sample that is derived from a 

30 cell (e.g., a cell lysate, a homogenate, or a cell fraction). In general samples can be 
obtained from various sources, particularly sources of lymphocytes. 

With respect to timing, comparison of expression levels can be done 
contemporaneously (e.g., a test and control cell are each contacted with a test agent in 
parallel reactions). The comparison alternatively can be conducted with expression levels 
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that have been determined at temporally distinct times. As an example, expression levels 
for the control cell can be collected prior to the expression levels for the test cell and stored 
for future use (e.g., expression levels stored on a computer compatible storage medium). 
The expression level for a control cell or baseline expression profile (e.g., 
5 baseline value) can be a value for a single cell or it can be an average, mean or other 

statistical value determined for a plurality of cells. As an example, the expression level for 
a control cell can be the average of the expression levels for a population of subjects (e.g., 
subjects not having an immune-related disorder such as asthma). In other instances, the 
value for each expression level for the control cell is a range of values representative of the 

10 range observed for a particular population. Expression level values can also be either 
qualitative or quantitative. The values for expression levels can also optionally be 
normalized with respect to the expression level of a nucleic acid that is not one of the 
markers under analysis. 

The comparative analysis required in some methods involves determining 

1 5 whether the expression level values are "comparable" (or similar"), or "differ" from one 
another. In some instances, the expression levels for a particular marker in test and control 
cells are considered similar if they differ from one another by no more than the level of 
experimental error. Often, however, expression levels are considered similar if the level in 
the test cell differs by less than 5%, 10%, 20%, 50%, 100%, 150%, or 200% with respect 

20 to the control cell,. It thus follows that in some instances the expression level for a 

particular marker in the test cell is considered to differ from the expression level for the 
same marker in the control cell if the difference is greater than the level of experimental 
error, or if it is greater than 5%, 10%, 20%, 50%, 100%, 150% or 200%. In some methods, 
the comparison involves a determination of whether there is a "statistically significant 

25 difference" in the expression level for a marker in the test and control cells. A difference is 
generally considered to be "statistically significant" if the probability of the observed 
difference occurring by chance (the p-value) is less than some predetermined level. As 
used herein a "statistically significant difference" refers to a p-value that is < 0.05, 
preferably < 0.01 and most preferably < 0.001. If gene expression is increased sufficiently 

30 such that it is different (as just defined) relative to the control cell or baseline, the 

expression of that gene is considered "up-regulated" or "increased." If, instead, gene 
expression is decreased so it differs from the control cell or baseline value, the expression 
of that gene is "down-regulated" or "decreased." 



WO 2004/083366 PCT/FI2004/000155 

27 

Comparison of the expression levels between test and control cells can involve 
comparing levels for a single marker or a plurality of markers (e.g., when expression 
profiles are compared). When the expression level for a single marker is determined, 
whether expression levels between the test and control cell are similar or different involves 
a comparison of the expression level of the single marker. When, however, expression 
levels for multiple markers are compared, the comparison analysis can involve two 
analyses: 1) a determination for each marker examined whether the expression level is 
similar between the test and control cells, and 2) a determination of how many markers 
from the group of markers examined show similar or different expression levels. The first 
determination is done as just described. The second determination typically involves 
determining whether at least 50% of the markers examined show similarity in expression 
levels. However, in methods were more stringent correlations are required, at least 60%, 
70%, 80%, 90%, 95% or 100% of the markers must show similar expression levels for the 
expression levels of the group of markers examined considered to be similar between the 
test and control cells. 

B. Screening Methods 

L Exemplary Approaches 

Monitoring changes in gene expression can provide certain advantages dining 

drug screening and development. Often drugs are pre-screened for the ability to interact 

* 

with a major target without regard to other effects the drugs have on cells. Often such 
other effects cause toxicity in the whole animal, which prevent the development and use of 
the potential drug. These global changes in gene expression provide useful markers for 
diagnostic uses as well as markers that can be used to monitor disease states, disease 
progression, and drug metabolism. Thus, these expression profiles of genes provide 
molecular tools for evaluating drug toxicity, drug efficacy, and disease monitoring. 

Changes in the expression profile from a baseline profile {e.g., the data in Table 
1) can be used as an indication of such effects. Those skilled in the art can use any of a 
variety of known techniques to evaluate the expression of one or more of the genes and/or 
gene fragments identified in the present application in order to observe changes in the 
expression profile in a cell or sample of interest. Comparison of the expression data, as 
well as available sequence or other information may be done by researcher or diagnostician 
or may be done with the aid of a computer and databases. 
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In some screening methods, compounds and molecules are screened to identify 
those that affect expression of a target gene or some other gene involved in regulating the 
expression of a target gene (e.g., by interacting with the regulatory region or transcription 
factors of a target gene). Compounds are also screened to identify those that affect the 

5 activity of such proteins (e.g. , by inhibiting target gene activity) or the activity of a 
molecule involved in the regulation of a target gene. 

So, for example, in some methods potential drug compounds are screened to 
determine if application of the compound alters the expression of one or more of the target 
genes identified herein. This may be useful, for example, in determining whether a 

10 particular compound is effective in treating asthma or other immune-mediated disease. In 
the case in which the expression of a gene during the CD4+ lymphocyte polarization is 
affected by the potential drug compound, the compound is indicated in the treatment of 
asthma or other immune-mediated disease. Similarly, a drug compound which causes 
expression of a gene which is normally down-regulated during the CD4+ lymphocyte 

1 5 polarization, may be indicated in the treatment of the same diseases. 

According to the present invention, the target genes listed in Table 2 or Table 6 
may also be used as markers to evaluate the effects of a candidate drug or agent on a 
lymphocyte cell, particularly undergoing polarization. A candidate drug or agent can be 
screened for the ability to stimulate the transcription or expression of a given marker or 

20 markers (drug targets) or to down-regulate or inhibit the transcription or expression of a 
marker or markers. According to the present invention, one can also compare the 
specificity of a drug's effects by looking at the number of markers affected by the drug and 
comparing them to the number of markers affected by a different drug. A more specific 
drug will affect fewer transcriptional targets. Similar sets of markers identified for two 

25 drugs indicates a similarity in effect. 

Some method are designed for identifying agents that modulate the levels, 
concentration or at least one activity of a protein(s) encoded by one or several genes in 
Table 2 or Table 6. Such methods or assays may utilize any means of monitoring or 
detecting the desired activity. 

30 One specific embodiment of the invention is a method of identifying a compound 

capable of modulating the polarization of CD4+ lymphocytes, the method comprising: 

(a) contacting the compound with naive CD4+ lymphocytes; 

(b) inducing the polarization of the lymphocytes; 

(c) preparing a gene expression profile from the lymphocytes; 
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(d) comparing the lymphocyte gene expression profile to a gene expression profile 
derived from Table 1 . 
Preferably the induction of step (b) is performed by contacting the lymphocytes 
with a cytokine. Preferably the cytokine is IL-12 or EL-4. A difference in the expression 
5 profiles of the target genes identifies a potential drug compound for the treatment of 
asthma or other immune-mediated diseases. Another preferred embodiment of the 
invention is an identification method of the invention, wherein said gene expression profile 
derived from Table 1 is at least partly based on the expression fold changes of any one of 
the genes selected from the group consisting of: KIAA0053, LRRN3, CIG5, DUSP6, 
10 FER1L3, S100P, SATB1, SLC1 1 A2, STK17B, a gene identified by accession number 
AI971 169 and a gene identified by accession number AL432401 . 

Assays and screens can be used to identify compounds that are effective 
activators or inhibitors of target gene expression or activity. The assays and screens can be 
done by physical selection of molecules from libraries, and computer comparisons of 
1 5 digital models of compounds in molecular libraries and a digital model of the active site of 
the target gene product (z'.e., protein). 

The activators or inhibitors identified in the assays and screens may act by, but 
are not limited to, binding to a target gene product, binding to intracellular proteins that 
bind to a target gene product, compounds that interfere with the interaction between a 
20 target gene product and its substrates, compounds that modulate the activity of a target 

gene, or compounds that modulate the expression of a target gene or a target gene product. 

Assays can also be used to identify molecules that bind to target gene regulatory 
sequences (e.g., promoter sequences), thus modulating gene expression. See, e.g., Piatt 
(1994), J. Biol. Chem., 269:28558-28562. 
25 Another specific embodiment of the invention is a method of identifying a 

compound that modulates the expression of at least one gene listed in Table 2 or Table 6. 
These methods involve: 

(a) incubating a cell that can express a protein from the gene with a compound 
under conditions and for a time sufficient for the cell to express the protein of 

30 said gene, when the compound is not present; 

(b) incubating a control cell under the same conditions and for the same time 
without the compound; 

(c) measuring expression of the gene in the cell in the presence of the compound; 

(d) measuring expression of the gene in the control cell; and 
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(e) comparing the amount of expression of the gene in the presence and absence 
of the compound, wherein a difference in the level of expression indicates that 
the compound modulates the expression of the gene. 
Another specific embodiment of the invention is a method of identifying a 
5 compound that modulates the activity of at least one gene listed in Table 2 or Table 6, the 
method comprising: 

(a) incubating a cell that has said activity with a compound under conditions and 
for a time sufficient for the cell to express said activity, when the compound is 
not present; 

1 0 (b) incubating a control cell under the same conditions and for the same time 

without the compound; 

(c) measuring said activity in the cell in the presence of the compound; 

(d) measuring said activity in the control cell; and 

(e) comparing the amount of said activity in the presence and absence of the 
15 compound, wherein a difference in the level of expression indicates that the 

compound modulates the activity of said gene. 
In one preferred embodiment, the gene or genes is/are selected from the group 
consisting of: KIAA0053, LRRN3, CIG5, DUSP6, FER1L3, S100P, SATB1, SLC11A2, 
STK17B, a gene identified by accession number AI971 169 and a gene identified by 
20 accession number AL432401 . 

Certain screening methods are performed with mouse lymphocytes. As an 
example, some methods involve identifying a compound capable of modulating the 
polarization of murine CD4+ lymphocytes. These methods comprise: 

(a) contacting the compound with naive murine CD4+ lymphocytes; 
25 (b) inducing the polarization of the lymphocytes; 

(c) preparing a gene expression profile from the lymphocytes; and 

(d) comparing the lymphocyte gene expression profile to a gene expression profile 
derived from Table 3. 

This method can also be used for determining or confirming the activity of a 
30 compound in murine cells, when said compound is already identified in any one of the 

above-mentioned human lymphocyte assay based identification methods as a modulator for 
polarization of human lymphocytes. 



2. Methods for Detecting Differential Gene Expression 
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Assays to monitor the expression of a marker or markers as defined in Table 2 or 
Table 6 may utilize any available means of monitoring for changes in the expression level 
of the target genes. As used herein, an agent is said to modulate the expression of a target 
gene if it is capable of up- or down-regulating expression of the target gene in a 
5 lymphocyte cell during polarization. The protein products encoded by the genes identified 
herein can also be assayed to determine the amount of expression. Any method for 
specifically and quantitatively measuring a specific protein or mRNA or DNA product can 
be used. However, methods and assays of the invention typically utilize PGR or array or 
chip hybridization-based methods when seeking to detect the expression of a large number 
10 of genes. 

[0001] The genes identified as being differentially expressed in lymphocytes may be 
used in a variety of nucleic acid detection assays to detect or quantify the expression level 
of a gene or multiple genes in a given sample. For example, traditional Northern blotting, 
dot blots, nuclease protection, RT-PCR, differential display methods, subtractive 

1 5 hybridization, and in situ hybridization may be used for detecting gene expression levels. 
Levels of mRNA expression may be monitored directly by hybridization of probes to the 
nucleic acids of the invention. Cell lines are exposed to an agent to be tested under 
appropriate conditions and time and total RNA or mRNA is isolated by standard 
procedures such those disclosed in Sambrook et al, Molecular Cloning - A Laboratory 

20 Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1989). 

One of skill in the art will appreciate that an enormous number of array designs 
are suitable for the practice of this invention. The high density array will typically include 
a number of probes that specifically hybridize to the sequences of interest. See WO 
99/32660 for methods of producing probes for a given gene or genes. In addition, in a 

25 preferred embodiment, the array will include one or more control probes. 

3. Exemplary Candidate Agents 

Agents that are assayed in the above methods can be randomly selected or 
rationally selected or designed. As used herein, an agent is said to be randomly selected 
when the agent is chosen randomly without considering the specific sequences of the 
30 protein itself or those sequences involve din the interaction of the protein with its 

substrates or ligands, for instance. An example of randomly selected agents is a chemical 
library or a peptide combinatorial library, or a growth broth of an organism. As used 
herein, an agent is said to be rationally selected or designed when the agent is chosen on a 
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nonrandom basis, taking into account the sequence of the target site and/or its 
conformation in connection with the agents action. Agents can be selected or designed by 
utilizing the peptide sequences that make up these sites. For example, a rationally selected 
peptide agent can be a peptide whose amino acid sequence is identical to or a derivative of 
5 any functional consensus site. 

The agents of the present invention can be, as examples, peptides, small chemical 
molecules, vitamin derivatives, as well as carbohydrates, lipids, oligonucleotides and 
covalent and non-covalent combinations thereof. Dominant negative proteins, DNA 
encoding these proteins, antibodies to these proteins, peptide fragments of these proteins or 
10 mimics of these proteins may be introduced into cells. A "mimetic" as used herein refers 
to a protein having a modification of a region or several regions such that the protein 
mimetic has a structure chemically different from the parent peptide but topographically 
and functionally similar to the parent peptide. A skilled artisan can readily recognize that 
there is no limit as to the structural nature of the agents of the present invention. 

■ 

15 Some compounds are peptides, including but not limited to, soluble peptides, 

including but not limited to members of random peptide libraries (see, e.g., Lam et al. 
(1991) Nature 354:82-84; Houghten et al. (1991) Nature 354:84-86,), and combinatorial 
chemistry-derived molecular libraries made of D-and/or L-amino acids, phosphopeptides 
(including, but not limited to, members of random or partially degenerate, directed 

20 phosphopeptide libraries; see, e.g., Songyang et al. (1993) Cell 72:767-778,), and small 
organic or inorganic molecules. 

3. Computerized Analysis 

Computer modeling or searching technologies can be used to identify 
compounds, or identify modified compounds that modulate or are candidates to modulate 
25 the expression or activity of a target gene product. For example, compounds likely to 

interact with the active site of the target gene product may be identified. The active site of 
target gene product can be identified using methods known in the art including, for 
example, analysis of the amino acid sequence of a molecule, and from a study of 
complexes formed by a target gene product with a native ligand. Chemical or X-ray 
30 crystallographic methods can be used to identify the active site of target gene product 
through the location of a bound ligand. 

The three-dimensional structure of the active site can be determined. This can be 
done using known methods, including X-ray crystallography, which can be used to 
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determine a complete molecular structure. Solid or liquid phase NMR can be used to 
determine certain intramolecular distances. Other methods of structural analysis can be 
used to determine partial or complete geometrical structures. 

Computer-based numerical modeling can be used to complete an incomplete or 
5 insufficiently accurate structure. Modeling methods that can be used are, for example, 
parameterized models specific to particular biopolymers such as proteins or nucleic acids, 
molecular dynamics models based on computing molecular motions, statistical mechanics 
models based on thermal ensembles, or combined models. For most types of models, 
standard molecular force fields, representing the forces between constituent atoms and 

10 groups are necessary, and can be selected from force fields known in physical chemistry. 
Information on incomplete or less accurate structures determined in this way can be 
incorporated as constraints on the structures computed by these modeling methods. 

Having determined the structure of the active site of a target gene product, either 
experimentally, by modeling, or by a combination of methods, additional candidate 

1 5 modulating compounds can be identified by searching databases containing compounds 
along with information on their molecular structure. The compounds identified in such a 
search are those that have structures that match the active site structure, fit into the active 
site, or interact with groups defining the active site. The compounds identified by the 
search are potential target gene product modulating compounds. 

20 These methods may also be used to identify improved modulating compounds 

from an already known modulating compounds or ligands. The structure of the known 
compound is modified and effects are determined using experimental and computer 
modeling methods. The altered structure is compared to the active site structure of a target 
gene product to determine or predict how a particular modification to the ligand or 

25 modulating compound will affect its interaction with that protein. Systematic variations in 
composition, such as by varying side groups, can be evaluated to obtain modified 
modulating compounds or ligands of preferred specificity or activity. 

Given the teachings herein, additional experimental and computer modeling 
methods useful to identify modulating compounds based on identification of the active 

30 sites of a target gene product and related transduction and transcription factors can be 
developed by those skilled in the art. Computer programs designed to screen and depict 
chemicals as well as molecular modeling systems are available from companies such as 
MSI (Molecular Simulations, Inc., San Diego, CA, USA), Allelix, Inc. (Mississauga, 
Ontario, Canada), and Hypercube, Inc. (Gainesville, FL, USA). 
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In addition to designing and generating compounds that alter binding, as 
described above, libraries of known compounds, including natural products, synthetic 
chemicals, and biologically active materials including peptides, can be screened for 
compounds that are inhibitors or activators. 
5 Compounds identified by methods described above may be useful, for example, 

for elaborating the biological function of target gene products and in treatment of disorders 
in which target gene activity is deleterious. 

C Diagnostic Methods 

Methods for assessing whether a subject has or is predisposed to obtain an 

10 immune-mediated disease {e.g. asthma, allergy or auto-immune disease) are also provided. 
These methods generally involve obtaining a sample from a subject having or suspected to 
have an immune-mediated disease. The expression levels for one or more the differentially 
expressed genes is then determined for the sample. The population of test cells is selected 
to include lymphocytic cells from the subject. 

15 The expression level of the gene(s) is then compared with the expression level of 

the same gene(s) in a control sample. The status of the control sample with respect to 
presence or absence of an immune-mediated disease is known (e.g., the control sample is 
from an individual not suffering from the immune-related disease of interest or is from an 
individual having an immune-related disease). So, for example, if the control cell is 

20 representative of cells from a healthy individual, then similarity in expression level or 

expression profile between the test and control samples indicates that the subject does not 
have an immune-related disease. A difference in expression level or profile, in contrast, 
indicates that the subject from whom the test sample was derived has an immune-related 
disease. 

25 If instead, the control sample is representative of cells from an individual that has 

a particular immune-related disease, then similarity in expression levels or expression 
profile means that the test cells are from a patient that has, or is susceptible to, the 
immune-related disease; whereas, a difference in expression levels or profile indicates that 
the subject does not have the immune-related disease. 

30 D. Classifying Lymphocytes 

Other methods that are provided are designed to classify a lymphocyte or to 
assess its cellular state. Such methods generally involve obtaining a test sample derived 
from a lymphocyte that is capable of expressing one or more nucleic acid markers from the 
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group consisting of those listed in one or more of the tables (eg., Table 1, or Table 2 or 
Table 6. The expression level for one or more of these markers is then determined. The 
expression level for these markers is compared with the expression levels of the same 
markers in a control sample. The control sample is derived from a lymphocytic cell whose 
5 cellular status is known (e.g. , the cell is known to be a Thl or Th2 cell). The lymphocytic 
cell from which the test sample is derived is then classified on the basis of this comparison. 

For example, if the expression level or expression profile of the test sample is 
compared with expression levels or an expression profile from a Thl cell, similarity in 
expression profile is an indication that the lymphocyte from which the test sample is 
10 derived is a Thl cell. A difference, on the other hand, would be an indication that the 
sample is from a lymphocyte of another type or in another state (e.g., a Th2 cell). 

VII. Treatment Methods 

Methods of treating a patient with asthma or other immune-mediated diseases are 
also provided. These methods generally involve administering to the patient a 

15 pharmaceutical composition, wherein the composition alters the expression or activity of at 
least one gene listed in Table 2 or Table 6. In a preferred embodiment, the active 
compound of the pharmaceutical composition is identified by a screening method of the 
invention. In another preferred embodiment, the active compound of said pharmaceutical 
composition is an antibody binding to at least one gene product of the genes listed in Table 

20 2 or Table 6. 

Both therapeutic and prophylactic methods are provided. In therapeutic methods, 
a pharmaceutical composition is administered to a subject having or suspected to have an 
immune-related disease in an amount sufficient to alleviate one or more symptoms of the 
disease. In prophylactic methods, a pharmaceutical composition is administered to a 

25 subject susceptible to, or otherwise at risk for developing an immune-related disease, in an 
amount sufficient to reduce or arrest the development of the disease. The treatment can be 
administered in a single dose, but more commonly is administered in several doses. 

If the immune-related disease, is a consequence of an excessive Thl response, 
then certain methods involve administering an agent that inhibits the expression of a gene 

30 that is up-regulated during Thl polarization or that inhibits the activity of the protein it 
encodes. Alternatively, an agent can be administered that activates the expression of a 
gene that is up-regulated in Th2 cells, or which increases the activity of the protein 
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encoded by such a gene. A third option is to administer one or more agents that achieve 
both of these results. 

If instead the disease is associated with an excessive Th2 response, then some 
treatment strategies involve administering an agent that inhibits the expression or activity 
5 of a gene that is up-regulated in Th2 cells. Alternatively, an agent is administered that 
activates the expression or activity of a gene that is up-regulated in Thl cells. Still other 
methods involve providing an agent or agents that accomplishes both of these results. 

A number of methods that are known in the art can be utilized to modulate gene 
expression or activity. Various agents can be used to inhibit gene expression or the activity 

10 of the corresponding protein. Examples of such agents include antisense oligonucleotides, 
.ribozymes, triple helix structure and double-stranded RN A (dsRNA), particularly small- 
interfering RNAs (siRNAs). These agents are discussed in additional detail below. 
Alternatively, compounds that antagonize the activity of the protein encoded by the up- 
regulated genes can also be utilized. Examples include antibodies that specifically bind to 

15. the encoded protein. Other antagonists are small molecules. 

Various options are also available for increasing gene expression or the activity 
of the protein encoded by a gene. One option is to administer a nucleic acid that encodes 
the protein whose activity one seeks to increase. This nucleic acid is operably linked to an 
appropriate expression control elements to facilitate its expression lymphocytes. Another 

20 option is to administer the protein itself, or an active fragment thereof. Yet another option 
is to administer an agonist that increases the activity of the protein. 

Vm. Compounds for Inhibiting or Enhancing the Synthesis or Activity of Control 
Genes 

A. Activity or Synthesis Inhibition 

25 As discussed above, expression of certain genes can cause or worsen the 

symptoms of an immune-related disease. The increase in the expression or activity of such 
control genes and their products can be countered using various methodologies to inhibit 
their expression, synthesis or activity. 

For example, antisense, ribozyme, triple helix molecules and antibodies can be 

30 utilized to ameliorate the negative effects of such control genes and gene products. 
Antisense RNA and DNA molecules act directly to block the translation of mRNA by 
hybridizing to targeted mRNA, thereby blocking protein translation. Hence, a useful target 
for antisense molecules is the translation initiation region. / 
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Ribozymes are enzymatic RNA molecules that hybridize to specific sequences 
and then carry out a specific endonucleolytic cleavage reaction. Thus, for effective use, 
the ribozyme should include sequences that are complementary to the target mRNA, as 
well as the sequence necessary for carrying the cleavage reaction (see, e.g., U.S. Pat. No. 
5 5,093,246). 

Nucleic acids utilized to promote triple helix formation to inhibit transcription are 
single-stranded and composed of dideoxyribonucleotides. The base composition of such 
polynucleotides is designed to promote triple helix formation via Hoogsteen base pairing 
rules and typically require significant stretches of either pyrrolidines or purines on one 

1 0 strand of a duplex. 

Double stranded RNA (dsRNA) inhibition methods can also be use to inhibit 
expression of one or more of the differentially expressed nucleic acids. The RNA utilized 
in such methods is designed such that a least a region of the dsRNA is substantially 
identical to a region of a differentially expressed nucleic acid (e.g., a target gene); in some 

15 instances, the region is 100% identical to the target. For use in mammals, the dsRNA is 
typically about 19-30 nucleotides in length (i.e., small inhibitory RNAs are utilized 
(siRNA)). Methods and compositions useful for performing dsRNAi and siRNA are 
discussed, for example, in PCT Publications WO 98/53083; WO 99/32619; WO 99/53050; 
WO 00/44914; WO 01/36646; WO 01/75164; WO 02/44321; and published U.S. Patent 

20 application 10/195,034, each of which is incorporated herein by reference in its entirety for 
all purposes. 

Antibodies having binding specificity for a target gene protein that also interferes 
with the activity of the gene protein can also be utilized to inhibit gene protein activity. 
Such antibodies can be generated from full-length proteins or fragments thereof according 
25 to the methods described below. 

B. Activity Enhancement 

Immune-related diseases can be exacerbated by under-expression of certain 
control genes and/or by a reduction in activity of a control gene product, for example. 
Alternatively, the up-regulation of certain control gene products can produce a beneficial 
30 effect. In any of these scenarios, it is useful to increase the expression, synthesis or 
activity of such control genes and proteins. 

These goals can be achieved, for example, by increasing the level of control gene 
product or the concentration of active gene product. In one approach, a control gene / 
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protein in the form of a pharmaceutical composition such as that described below is 
administered to a subject suffering from an immune-related disease. Alternatively, DNA 
sequences encoding control gene proteins can be administered to a patient at a 
concentration sufficient to treat a immune-related disease or to treat an individual at risk 
5 for such a disease. Gene therapy is yet another option and includes inserting one or more 
copies of a normal control gene, or a fragment thereof capable of producing a functional 
control protein, into lymphocytic cells using various vectors. Suitable vectors include, for 
example, adenovirus, adeno-associated virus and retrovirus vectors. Liposomes and other 
particles capable of introducing DNA into cells can also be utilized in some instances. 
10 Cells, typically autologous cells, that express a normal control gene can than be introduced 
or reintroduced into a patient to treat the immune-related disease* 

X. Antibodies 

Antibodies that are immunoreactive with polypeptides expressed from the 
differentially expressed genes or fragments thereof are also provided. The antibodies can 
15 be polyclonal antibodies, distinct monoclonal antibodies or pooled monoclonal antibodies 
with different epitopic specificities. 

A. Production of Antibodies 
The antibodies can be prepared using intact polypeptide or fragments containing 
antigenic determinants from proteins encoded by differentially expressed genes or target 
20 genes as the immunizing antigen. The polypeptide used to immunize an animal can be 

from natural sources, derived from translated cDNA, or prepared by chemical synthesis. In 
some instances the polypeptide is conjugated with a carrier protein. Commonly used 
carriers include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin 
(BSA), and tetanus toxoid The coupled peptide is then used to immunize the animal 
25 a mouse, a rat, or a rabbit). Various adjuvants can be utilized to increase the 

» 

immunological response, depending on the host species and include, but are not limited to, 
Freund's (complete and incomplete), mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, dinitrophenol and carrier proteins, as well as human adjuvants such as BCG 
30 (bacille Calmette-Guerin) and Corynebacterium parvum. 

Monoclonal antibodies can be made from antigen-containing fragments of the 
protein by the hybridoma technique, for example, of Kohler and Milstein (Nature, 
256:495-497, (1975); and U.S. Pat. No. 4,376,1 10, incorporated by reference in their 
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entirety). See also, Harlow & Lane, Antibodies, A Laboratory Manual (C.S.H.P., NY, 
1988), incorporated by reference in its entirety. The antibodies can be of any 
immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. 

Techniques for generation of human monoclonal antibodies have also been 
5 described, including, for example, the human B-cell hybridoma technique (Kosbor et aL, 
Immunology Today 4:72 (1983), incoiporated by reference in its entirety); for a review, see 
also, Larrick et aL, U.S. Pat. No. 5,001,065, (incorporated by reference in its entirety). An 
alternative approach is the generation of humanized antibodies by linking the 
complementarity-determining regions or CDR regions {see, e.g., Kabat etaL, "Sequences 

10 of Proteins of Immunological Interest," U.S. Dept. of Health and Human Services, (1987); 
and Chothia et aL, J. Mol Biol 196:901-917 (1987)) of non-human antibodies to human 
constant regions by recombinant DNA techniques. See Queen et al. 9 Proc. Natl. Acad. Set 
USA 86:10029-10033 (1989) and WO 90/07861 (incoiporated by reference in its entirety). 
Alternatively, one can isolate DNA sequences that encode a human monoclonal antibody 

15 or a binding fragment thereof by screening a DNA library from human B cells according to 
the general protocol set forth by Huse et aL, Science 246:1275-1281 (1989) and then 
cloning and amplifying the sequences which encode the antibody (or binding fragment) of 
the desired specificity. The protocol described by Huse is rendered more efficient in 
combination with phage display technology. See, e.g., Dower et aL, WO 91/17271 and 

20 McCafferty et aL, WO 92/01047 (each of which is incorporated by reference). Phage 
display technology can also be used to mutagenize CDR regions of antibodies previously 
shown to have affinity for the peptides of the present invention. Antibodies having 
improved binding affinity are selected 

Techniques developed for the production of "chimeric antibodies" by splicing the 

25 genes from a mouse antibody molecule of appropriate antigen specificity together with 
genes from human antibody molecule of appropriate antigen specificity can be used. A 
chimeric antibody is a molecule in which different portions are derived from different 
species, such as those having a variable region derived from a murine monoclonal antibody 
and a human immunoglobulin constant region. Single chain antibodies specific for the 

30 differentially expressed gene products of the invention can be produced according to 

established methodologies (see, e.g, U.S. Pat. No. 4,946,778; Bird, Science 242:423-426 
(1988); Huston et aL, Proc. Natl Acad Sci. USA 85:5879-5883 (1988); and Ward et aL, 
Nature 334:544-546 (1989), each of which is incorporated by reference in its entirety). 
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Single chain antibodies are formed by linking the heavy and light chain fragments of the 
Fv region via an amino acid bridge, resulting in a single chain polypeptide. 

Antibodies can be further purified, for example, by binding to and elution from a 
support to which the polypeptide or a peptide to which the antibodies were raised is bound. 
5 A variety of other techniques known in the art can also be used to purify polyclonal or 
monoclonal antibodies (see, e.g., Coligau, etah, Unit 9, Current Protocols in Immunology, 
Wiley Interscience, (1994), incorporated herein by reference in its entirety). 

Anti-idiotype technology can also be utilized in some instances to produce 
monoclonal antibodies that mimic an epitope. For example, an anti-idiotypic monoclonal 
10 antibody made to a first monoclonal antibody will have a binding domain in the 

hypervariable region that is the "image" of the epitope bound by the first monoclonal 
antibody. 

B. Use of Antibodies 
The antibodies that are provided are useful, for example, in screening cDNA 

15 expression libraries and for identifying clones containing cDNA inserts which encode 
structurally-related, immunocrossreactive proteins. See, for example, Aruffo & Seed, 
Proc. Natl Acad. Set USA 84:8573-8577 (1977) (incorporated by reference in its entirety). 
Antibodies are also useful to identify and/or purify immunocrossreactive proteins that are 
structurally related to native polypeptide or to fragments thereof used to generate the 

20 antibody. The antibodies can also be used to form antibody arrays to detect proteins 
expressed by the differentially expressed genes. 

The antibodies can also be used in the detection of differentially expressed genes, 
such as control and fingerprint gene products. Various diagnostic assays can be utilized, 
including but not limited to, competitive binding assays, direct or indirect sandwich assays 

25 and immunoprecipitation assays (see, e.g., Monoclonal Antibodies: A Manual of 

Techniques, CRC Press, Inc. (1987) pp. 147-158). When utilized in diagnostic assays, the 
antibodies are typically labeled with a detectable moiety. The label can be any molecule 
capable of producing, either directly or indirectly, a detectable signal. Suitable labels 
include, for example, radioisotopes (e.g., 3 H, 14 C, 32 P, 35 S, 125 I), fluorophores (e.g., 

30 fluorescein and rhodamine dyes and derivatives thereof), chromophores, chemiluminescent 
molecules, an enzyme substrate (including the enzymes luciferase, alkaline phosphatase, 
beta-galactosidase and horse radish peroxidase, for example). The antibodies can also be 
utilized in the development of antibody arrays. / 
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As noted above, antibodies are useful in inhibiting the expression products of the 
differentially expressed genes and are valuable in inhibiting the action of certain control 
gene products {e.g. , target gene products identified as causing or exacerbating tumor or 
cancer formation). Hence, the antibodies also find utility in a variety of therapeutic 
5 applications. 

XI. Pharmaceutical Compositions 

The invention provides also a pharmaceutical composition that can modulate the 
expression or activity of at least one gene listed in one or more of the tables (e.g., Table 2 
or Table 6) for use in prophylaxis or treatment of asthma or other immune-mediated 
10 disease. In a preferred embodiment said pharmaceutical composition comprises an 

antibody binding to at least one gene product of the genes listed in Table 2 or Table 6 as an 
active ingredient. 

A. Composition 
The pharmaceutical compositions used for treatment of cancers and tumors 

1 5 comprise an active ingredient such as the inhibitory or activity-enhancing compounds such 
as described herein and, optionally, various other components. 

Thus, for example, the compositions can also include, depending on the 
formulation desired, pharmaceutically-acceptable, non-toxic carriers of diluents, which are 
defined as vehicles commonly used to formulate pharmaceutical compositions for animal 

20 or human administration. The diluent is selected so as not to affect the biological activity 
of the combination. Examples of such diluents are distilled water, buffered water, 
physiological saline, PBS, Ringer's solution, dextrose solution, and Hank's solution. In 
addition, the pharmaceutical composition or formulation can include other carriers, 
adjuvants, or non-toxic, nontherapeutic, nonimmunogenic stabilizers, excipients and the 

25 like. The compositions can also include additional substances to approximate 

■ 

physiological conditions, such as pH adjusting and buffering agents, toxicity adjusting 

> 

agents, wetting agents, detergents and the like. 

The composition can also include any of a variety of stabilizing agents, such as an 
antioxidant for example. When the pharmaceutical composition includes a polypeptide, 
30 the polypeptide can be complexed with various well known compounds that enhance the in 
vivo stability of the polypeptide, or otherwise enhance its pharmacological properties (e.g., 
increase the half-life of the polypeptide, reduce its toxicity, enhance solubility or uptake). 
Examples of such modifications or complexing agents include the production of sulfate, 
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gluconate, citrate, phosphate and the like. The polypeptides of the composition can also be 
complexed with molecules that enhance their in vivo attributes. Such molecules include, 
for example, carbohydrates, polyamines, amino acids, other peptides, ions {e.g., sodium, 
potassium, calcium, magnesium, manganese), and lipids. 
5 Further guidance regarding formulations that are suitable for various types of 

administration can be found in Remington's Pharmaceutical Sciences, Mace Publishing 
Company, Philadelphia, PA, 17th ed. (1985). For a brief review of methods for drug 
delivery, see, Langer, Science 249:1527-1533 (1990). 

B. Dosage 

10 The pharmaceutical compositions can be administered for prophylactic and/or 

therapeutic treatments. The active ingredient in the pharmaceutical compositions typically 
is present in a therapeutic amount, which is an amount sufficient to slow or reverse tumor 
formation, to eliminate the tumor, or to remedy symptoms associated with the tumor or 
cancer. Toxicity and therapeutic efficacy of the active ingredient can be determined 

1 5 according to standard pharmaceutical procedures in cell cultures and/or experimental 
animals, including, for example, determining the LD 50 (the dose lethal to 50% of the 
population) and the ED 50 (the dose therapeutically effective in 50% of the population). 
The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD50/ED50. Compounds that exhibit large therapeutic indices are 

20 preferred. { 

The data obtained from cell culture and/or animal studies can be used in 
formulating a range of dosages for humans. The dosage of the active ingredient typically 
lines within a range of circulating concentrations that include the ED50 with little or no 
toxicity. The dosage can vary within this range depending upon the dosage form employed 

25 and the route of administration utilized. 

In prophylactic applications, compositions containing the compounds that are 
provided are administered to a patient susceptible to or otherwise at risk of tumor 
formation. Such an amount is defined to be a "prophylactically effective" amount or dose. 
In this use, the precise amount depends on the patient's state of health and weight. 

30 Typically, the dose ranges from about 1 to 500 mg of purified protein per kilogram of body 
weight, with dosages of from about 5 to 100 mg per kilogram being more commonly 
utilized. 

/ 
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C. Administration 
The active ingredient, alone or in combination with other suitable components, 
can be made into aerosol formulations (Le. 9 they can be "nebulized") to be administered 
via inhalation. Aerosol formulations can be placed into pressurized acceptable propellants, 
5 such as dichlorodifluoromethane, propane, nitrogen. 

Suitable formulations for rectal administration include, for example, 
suppositories, which consist of the packaged active ingredient with a suppository base. 
Suitable suppository bases include natural or synthetic triglycerides or paraffin 
hydrocarbons. In addition, it is also possible to use gelatin rectal capsules, which consist 
10 of a combination of the packaged nucleic acid with a base, including, for example, liquid 
triglycerides, polyethylene glycols, and paraffin hydrocarbons. 

Formulations suitable for parenteral administration, such as, for example, by 
intraarticular (in the joints), intravenous, intramuscular, intradermal, intraperitoneal, and 
subcutaneous routes, include aqueous and non-aqueous, isotonic sterile injection solutions, 
15 which can contain antioxidants, buffers, bacteriostats, and solutes that render the 
formulation isotonic with the blood of the intended recipient, and aqueous and non- 
aqueous sterile suspensions that can include suspending agents, solubilizers, thickening 
agents, stabilizers, and preservatives. In the practice of this invention, compositions can be 
administered, for example, by intravenous infusion, orally, topically, intraperitoneally, 

20 intravesically or intrathecally. Formulations for injection can be presented in unit dosage 

■ 

form, e.g., in ampules or in multidose containers, with an added preservative. The 
compositions are formulated as sterile, substantially isotonic and in full compliance with 
all Good Manufactoing Practice (GMP) regulations of the U.S. Food and Drug 
Administration. 

25 XII. Devices for Detecting Differentially Expressed Nucleic Acids 

A. Customized Probe Arrays 

1. Probes for Differentially Expressed Genes 

The differentially expressed genes that are provided can be utilized to prepare 

custom probe arrays for use in screening and diagnostic applications. In general, such 

30 arrays include probes such as those described above in the section on differentially 

expressed nucleic acids, and thus include probes complementary to full-length 

differentially expressed nucleic acids (e.g., cDNA arrays) and shorter probes that are 

typically 10-30 nucleotides long (e.g., synthesized arrays). Typically, the arrays include 

probes capable of detecting a plurality of the differentially expressed genes of the 
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invention. For example, such arrays generally include probes for detecting at least 2, 3, 4, 
5, 6, 7, 8, 9 or 10 differentially expressed nucleic acids. For more complete analysis, the 
arrays can include probes for detecting at least 12, 14, 16, 18 or 20 differentially expressed 
nucleic acids. In still other instances, the arrays include probes for detecting at least 25, 
5 30, 35, 40, 45 or all the differentially expressed nucleic acids that are identified herein. 

2. Control Probes 

(a) Normalization Controls 

Normalization control probes are typically perfectly complementary to one or 

more labeled reference polynucleotides that are added to the nucleic acid sample. The 
1 0 signals obtained from the normalization controls after hybridization provide a control for 
variations in hybridization conditions, label intensity, reading and analyzing efficiency and 
other factors that can cause the signal of a perfect hybridization to vary between arrays. 
Signals (e.g., fluorescence intensity) read from all other probes in the array can be divided 
by the signal (<?.g., fluorescence intensity) from the control probes thereby normalizing Hie 
15 measurements. 

Virtually any probe can serve as a normalization control. However, hybridization 
efficiency can vary with base composition and probe length. Normalization probes can be 
selected to reflect the average length of the other probes present in the array, however, they 
can also be selected to cover a range of lengths. The normalization control(s) can also be 
20 selected to reflect the (average) base composition of the other probes in the array. 

Normalization probes can be localized at any position in the array or at multiple positions 
throughout the array to control for spatial variation in hybridization efficiently. 

(b) Mismatch Controls 

Mismatch control probes can also be provided; such probes function as 

25 expression level controls or for normalization controls. Mismatch control probes are 
typically employed in customized arrays containing probes matched to known mRNA 
species. For example, certain arrays contain a mismatch probe corresponding to each 
match probe. The mismatch probe is the same as its corresponding match probe except for 
at least one position of mismatch. A mismatched base is a base selected so that it is not 

30 complementary to the corresponding base in the target sequence to which the probe can 
otherwise specifically hybridize. One or more mismatches are selected such that under 
appropriate hybridization conditions (e.g. stringent conditions) the test or control probe can 
be expected to hybridize with its target sequence, but the mismatch probe cannot hybridize 
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(or can hybridize to a significantly lesser extent). Mismatch probes can contain a central 
mismatch. Thus, for example, where a probe is a 20 mer, a corresponding mismatch probe 
can have the identical sequence except for a single base mismatch (e.g., substituting a G, a 
C or a T for an A) at any of positions 6 through 14 (the central mismatch). 

5 (c) Sample Preparation, Amplification, and Quantitation Controls 

Arrays can also include sample preparation/amplification control probes. Such 
probes can be complementary to subsequences of control genes selected because they do 
not normally occur in the nucleic acids of the particular biological sample being assayed. 
Suitable sample preparation/amplification control probes can include, for example, probes 
1 0 to bacterial genes (e.g. , Bio B) where the sample in question is a biological sample from a 
eukaryote. 

The RNA sample can then be spiked with a known amount of the nucleic acid to 
which the sample preparation/amplification control probe is complementary before 
processing. Quantification of the hybridization of the sample preparation/amplification 
15 control probe provides a measure of alteration in the abundance of the nucleic acids caused 
by processing steps. Quantitation controls are similar. Typically, such controls involve 
combining a control nucleic acid with the sample nucleic acid(s) in a known amount prior 
to hybridization. They are useful to provide a quantitative reference and permit 
determination of a standard curve for quantifying hybridization amounts (concentrations). 

20 3. Array Synthesis 

Nucleic acid arrays for use in the present invention can be prepared in two 

general ways. One approach involves binding DNA from genomic or cDNA libraries to 

some type of solid support, such as glass for example. (See, e.g., Meier-Ewart, et al, 

Nature 361 :375-376 (1993); Nguyen, C. et al y Genomics 29:207-216 (1995); Zhao, N. et 

25 al, Gene, 158:207-213 (1995); Takahashi, R, et al, Gene 164:219-227 (1995); Schena, et 
al, Science 270:467-470 (1 995); Southern et ah, Nature Genetics Supplement 2 1 :5-9 
(1999); and Cheung, et al, Nature Genetics Supplement 21:15-19 (1999), each of which is 
incorporated herein in its entirety for all purposes.) 

The second general approach involves the synthesis of nucleic acid probes. One 

30 method involves synthesis of the probes according to standard automated techniques and 
then post-synthetic attachment of the probes to a support. See for example, Beaucage, 
Tetrahedron Lett, 22:1859-1862 (1981) and Needham-VanDevanter, et al, Nucleic Acids 
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Res., 12:6159-6168 (1984), each of which is incorporated herein by reference in its 
entirety. A second broad category is the so-called "spatially directed" polynucleotide 
synthesis approach. Methods falling within this category further include, by way of 
illustration and not limitation, light-directed polynucleotide synthesis, microlithography, 
5 application by ink jet, microchannel deposition to specific locations and sequestration by 
physical barriers. 

Light-directed combinatorial methods for preparing nucleic acid probes are 
described in U.S. Pat. Nos. 5,143,854 and 5,424,186 and 5,744,305; PCT patent 
publication Nos. WO 90/15070 and 92/10092; EP 476,014; Fodor et aL, Science 251:767- 

10 777 (1991); Fodor, et al., Nature 364:555-556 (1 993); and Lipshutz, et al, Nature Genetics 
Supplement 21:20-24 (1999), each of which is incorporated herein by reference in its 
entirety. These methods entail the use of light to direct the synthesis of polynucleotide 
probes in high-density, miniaturized arrays. Algorithms for the design of masks to reduce 
the number of synthesis cycles are described by Hubbel et al, U.S. 5,571,639 and U.S. 

15 5,593,839, and by, Fodor et al, Science 251:767-777 (1991), each of which is incorporated 
herein by reference in its entirety. 

Other combinatorial methods that can be used to prepare arrays for use in the 
current invention include spotting reagents on the support using ink jet printers. See Pease 
et al., EP 728, 520, and Blanchard, et al Biosensors and Bioelectronics II: 687-690 (1996), 

20 which are incorporated herein by reference in their entirety. Arrays can also be 

synthesized utilizing combinatorial chemistry by utilizing mechanically constrained 
flowpaths or microchannels to deliver monomers to cells of a support. See Winkler et al, 
EP 624,059; WO 93/09668; and U.S. Pat. No. 5,885,837, each of which is incorporated 
herein by reference in its entirety. 

25 4. Array Supports 

Supports can be made of any of a number of materials that are capable of 
supporting a plurality of probes and compatible with the stringency wash solutions, 
Examples of suitable materials include, for example, glass, silica, plastic, nylon or 
nitrocellulose. Supports are generally are rigid and have a planar surface. Supports 

30 typically have from 1-10,000,000 discrete spatially addressable regions, or cells. Supports 
having 10-1,000,000 or 100-100,000 or 1000-100,000 regions are common. The density of 
cells is typically at least 1000, 10,000, 100,000 or 1,000,000 regions within a square 
centimeter. Each cell includes at least one probe; more frequently, the various cells / 
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include multiple probes. In general each cell contains a single type of probe, at least to the 
degree of purity obtainable by synthesis methods, although in other instances some or all 
of the cells include different types of probes. Further description of array design is set 
forth in WO 95/1 1995, EP 717,1 13 and WO 97/29212, which are incorporated by 
5 reference in their entirety. 

Xm. Kits 

Kits containing components necessary to conduct the screening and diagnostic 
methods of the invention are also provided. Some kits typically include a plurality of 
probes that hybridize under stringent conditions to the different differentially expressed 

10 nucleic acids that are provided. Other kits include a plurality of different primer pairs, 
each pair selected to effectively prime the amplification of a different differentially 
expressed nucleic acid. In the case when the kit includes probes for use in quantitative RT- 
PCR, the probes can be labeled with the requisite donor and acceptor dyes, or these can be 
included in the kit as separate components for use in preparing labeled probes. 

15 The kits can also include enzymes for conducting amplification reactions such as 

various polymerases (e.g., RT and Taq), as well as deoxynucleotides and buffers. Cells 
capable of expressing one or more of the differentially expressed nucleic acids of the 
invention can also be included in certain kits. 

Typically, the different components of the kit are stored in separate containers. 

20 Instructions for use of the components to conduct an analysis are also generally included. 

The following examples are offered to illustrate certain aspects of the methods 
and devices that are provided; it should be understood that these examples are not to be 
construed to limit the claimed invention. 

EXAMPLES 

25 A. Materials and Methods 

- 

Induction ofThl and Th2 in vitro polarization 

The mononuclear cells were isolated from the human cord blood of healthy 
neonates using Ficol Isolation Paque (Amersham Pharmacia Biotech, Uppsala, Sweden). 
The CD4+ cells were further purified using magnetic beads (Dynal, Oslo, Norway). The 
30 primary activation was performed using plate-bound aCD3 (500-1 000 ng/ul for coating) 
and 500 ng/ul soluble aCD28 (Immunotech, Marseille, France). The cells were cultured in 
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the density of 0.5 - 2 x 10 6 cells/ml in Yssel's medium (Irvine Scientific, Santa Ana, CA) 
containing 1% AB-serum (Red Cross, Helsinki, Finland). The polarization of the cells was 
performed in either Thl medium containing 2.5 ng/ml of IL-12 (R&D Systems, 
Minneapolis, MN) or in Th2 medium containing 10 ng/ml of BL-4 (R&D Systems) in the 
5 presence and absence of 3 ng/ml TGFP (R&D Systems). Part of the cells was cultured in 
"neutral conditions" without any polarizing cytokines. Samples were collected at the time 
points of Oh, 2 h, 6h and 48 h. 

Real-Time quantitative RT-PCR 

Real-time quantitative RT-PCR was performed to control the quality of the cell 
samples with different treatments. The gene expression levels were measured for the IFN% 
GATA-3, T-bet and housekeeping gene Elongation factor 1 alpha (EFla), using TaqMan 
ABI Prism 7700 (TaqMan, ABI Prism 7700, Applied Biosystems, Foster City, CA) as 
described before (18). EFlawas used as a reference transcript. The expression of this 
housekeeping gene remains stable during the differentiation of Thl and Th2 cells (19). 
Primers and probes (Table 4) used for the quantification of gene expression (MedProbe, 
Norway) were designed using Primer Express software (Applied Biosystems). 

Oligonucleotide array studies 

The total RNA of the samples was isolated using the Trizol method (Invitrogen 
Co., Carlsbad, CA) and was further purified with Qiagen's RNAeasy minikit (Qiagen, 
20 Valencia, CA). 4-5 jig of totRNA was used as starting material for the sample preparation. 
The sample preparation was performed according to the instructions and recommendations 
provided by the probe array manufacturer (Affymetrix). The samples were hybridized to 
HG-U95Av2 arrays containing probes for approximately 10,000 genes. The data was 
analyzed on three consecutive levels. At the detection level, each probe was assigned a 
25 call of present, absent, marginal, or no call. The comparison level analysis of the cells 

cultured defines a gerie as upregulated if the signal log ratio between the reference and the 
target samples is larger than one (2-fold increase) and the target sample is present. 
Similarly, a gene is defined as downregulated if the signal log ratio is less than minus one 
(2-fold decrease) and the reference sample is present. At the third level of data analysis, 
30 genes that presented a consistent change in two separate experiments were considered as 
differentially expressed. The gene transcript levels were determined from data images 
with the algorithms in the GeneChip Microarray Suite™ software (Affymetrix MASS), and 



10 
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the subsequent gene filtering and data analysis was done with Microsoft Excel 2002, 
Microsoft Access 2000 softwares. 

Validation of the oligonucleotide array results 

For validation of the oligonucleotide array results with Real-Time RT-PCR, 

5 additional Thl and Th2 primary cultures were generated as previously described (1 8). 

Briefly, the priming was performed in the presence of 100 ng/ml PHA (Murex Diagnostics, 
Chatillon, France) and irradiated CD32-B7 transfected fibroblasts (20). The feeder cells 
were added in the final density of 1 x 10 6 cells/ml. Thl cultures were supplemented with 
2.5 ng/ml of IL-12 (R&D Systems, Minneapolis, MN). Th2 cultures were supplemented 

10 with 10 [ig/ml of anti-IL-12 (R&D Systems) and 10 ng/ml of IL-4 (R&D Systems). After 
48 hours of priming, 40 U/ml of IL-2 (R&D Systems) was added into the cultures to 
enhance the proliferation of the lymphocytes. Part of die cells were cultured without any 
polarizing cytokines in the presence of IL-2 alone. The cultures were generated from four 
individuals, and during polarization samples were collected at the time points 0 h, 6 h, 24 

15 h, 48 hand 7 d. 

Mice, reagents and cell culture 

Stat6 deficient mice and control wild-type Balb/cJ mice were from Jackson 
Laboratory (Bar Harbor, Maine). Splenic mononuclear cells were isolated with Ficoll- 
Paque PLUS (Amersham Pharmacia Biotech, Uppsala, Sweden). CD4 + T lymphocytes 

20 were further purified by magnetic CD4 MicroBeads (Miltenyi Biotech, Bergisch Gladbach, 
Germany). The cells were activated with plate-bound anti-mouse CD3 (clone 500A2, 
1 .26pg/ml, Pharmingen, San Diego, CA) and soluble anti-mouse CD28 (500ng/ml; 
Pharmingen, San Diego, CA) and were cultured in IDME media containing 10% FCS, 
nonessential amino acids and 2-mercaptoethanol (all from Gibco BRL, Life Technologies, 

25 Paisley, Scotland). RmIL-4 (lOng/ml, Pharmingen, San Diego, CA ) and anti-IL-12, anti- 
IFN-y (both are lOpg/ml, Pharmingen, San Diego, CA) were added for Th2 development. 
CD4 + T cells pooled from different mice were cultured as ThO (with activation and anti-IL- 
4, anti-IL-12 and anti-IFN-y) or Th2 differentiation. Cells were harvested at Oh, 2h, 6h, 24h 
and 48h. 

30 Oligonucleotide microarray analysis 

Total RNA was extracted by Trizol (Gibco BRL, Life Technologies, Paisley, 
Scotland) and further purified with RNeasy Kit (Qiagen, Valencia, CA). cDNA was ; 
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synthesized by Superscript II kit (Gibco BRL, Life Technologies, Paisley, Scotland) using 
T7-(dT)24 as primer. Biotin-labelled cRNA was prepared by in vitro transcription reaction 
using BioArray HighYield RNA Transcript Labelling Kit (Enzo Diagnostics, Inc. 
Farmingdale, NY) based on manufacture's protocol. The cRNA was purified, fragmented 
5 and hybridized to Affymetrix MG-U74A Genechips™. Arrays were stained and scanned 
according to Affymetrix (Santa Clara, CA) protocols. 

Affymetrix Microarray Suit™ (version 5.0) software was used for the data 
analysis. In this program, Signal Log Ratio is used to describe the change between a target 
and reference array. The change is expressed as log2 ratio. Therefore, log2 ratio (Signal 

10 Log Ratio) of 1 equal to a two-fold change. In this study, the probe sets were excluded if: 
1), the detection for both target and reference is Absent, 2) in comparison analysis, if the 
Change call gives NC (No Change) and 3), the signal log ratio between target and 
reference is between -1 and 1 . Since there are five independent cell cultures to study IL4 
inducible genes at 48h, t- test was also used to compare the signals between target 

15 (Activation + IL4) and reference (Activation) groups. And then 0.05 was set as the cutoff 
for p-value. 

Real time quantitative per (TaqMan 0 ) detection 

The principle of TaqMan® detection has been described previously. Primers and 
probes for TaqMan® detection were designed by Primer Express™ software (Applied 

20 Biosystems, Foster City, CA) and made by MedProbe (Oslo, Norway). The sequences for 
the primers and probes are listed in Table 5. Samples from three independent cultures 
were measured in duplicate in two separate runs. The standard deviation from these values 
must be less than the 0.5. C T value, which means the number of PCR cycles required for 
the detection of fluorescence signal to exceed a fixed threshold. The relative expressions 

25 of target mRNA were normalized against EFla: 
AC T = C T (EFla) - C T (target) 

- - i 

AACt=AC T i-ACt2 

Where ACn represents AC T from not treated (time point Oh) Stat6 +/+ and Stat6 -/- CD4 + T 
cells and AC T i represents AC T from other time points. With this calculation, basically 
30 AAC T equals to log 2 ratio, therefore AAC T 1 = 2 fold changes. The experiments were 

performed using an ABI PRISM 7700 Sequence Detection System (Applied Biosystems). 
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Immunofluorescence staining and facs analysis 

CD4 + T cells from Stat6 +/+ and Stat6 -/- mice were activated and cultured for 
Th2 priming. At 48h, samples were collected and stained. Briefly, cells were washed with 
PBS and then stained with FITC-conjugated rat anti-mouse Ly-6A/E monoclonal antibody 
5 (BD Biosystem, San Diego, CA) or isotype control antibody at 4°C for 20 min. Cells were 
washed twice and then analysed by FACScan with CellQuest software. At least 10000 cells 
were analysed for each sample. 

B. Results 

In order to elucidate the genes involved in the early polarization and to explore 

1 0 the inhibitory mechanism of TGFp on the differentiation of Thl and Th2 cells, we studied 
the gene expression profiles of the CD4+ cells induced to Thl and Th2 directions in the 
absence and presence of TGFp. First, the differential expression of known Thl and Th2 
marker genes IFN% T-bet and GATA-3 was studied in the 48-hour samples to ensure that 
the cells had been induced to differentiate to Thl and Th2 directions (FIG. 1) (13, 16). 

1 5 After that the samples were hybridized on HG-U95Av2 arrays representing probes for 
approximately 10,000 mainly known human genes. According to oligonucleotide array 
results, activation via CD3/CD28 alone induced expression of 437 probe sets and repressed 
361 probesets at 2-hour time point (data not shown). After 6 hours expression of 832 probe 
sets was upregulated, whereas 856 probes sets were downregulated in response to 

20 CD3/CD28-activation (data not shown).' After 48 hours, 582 probe sets become induced 
and 533 probes sets were repressed by Q)3/CD28-activation compared to the Thp cells 
(FIG. 2 A). In the figure only the genes showing 16-fold change in comparison between 
activated and Thp cells are presented. In addition to genes regulated by activation, 
numerous targets with various functions were identified as being regulated by the 

25 cytokines IL-12, IL-4 and TGFP (see Tables 1 and 2). 

Targets of IL-12 and EL-4 

The effects of IL-12 were modest during the immediate phase (2 and 6 hours) of 
polarization as over 2-fold changes were seen in expression of only 3 genes (Table 1). All 
of these were classified to be "not changed" by the MAS5 algorithm and thus might be not 
30 reliable changes. After 48-hours the effects of IL-12 become clear as 23 genes become 
regulated by IL-12 (Fig. 2B). Altogether 40 genes were detected to be differentially 
expressed by the cells induced to differentiate to Thl or Th2 directions for 48 hours (^able 
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1 and FIG. 2F). Previously reported "marker genes" IFNG, IL-2, IL18RAP, 
CTLA1/GZMB, G0S2, ANXA3, P2RX5, LJF and BLR1 were preferentially expressed by the 
cells induced to polarize to the Thl direction (21-23) and MAF/C-MAF, GATA-3, EBI2, 
IL10RA, Cox-2, NTRK1, CXCR4, E4BP4/NFIL3 andIL-13 were confirmed to be 
5 preferentially expressed by the cells induced to the Th2 direction (15, 16, 21, 22, 24-26). 
Moreover, suppressor of cytokine signaling 1 (TIP3/SOCS-1) was significantly more 
expressed by the cells polarized to the Th2 direction compared to those induced to the Thl 
direction. 

The effects of IL-4 were clear already after 2 hours of Th2 polarization when 34 
10 genes become regulated by EL-4 compared to the CD3/CD28-activation (Table 1). After 6 
hours, 38 genes were detected to be regulated by EL-4. After 48 hours IL-4 regulated 
expression of 41 genes (Table I and FIG. 2C). These genes included both known and 
unknown IL-12 and IL-4 regulated genes. To our knowledge, regulation of the genes 
PACE, MRF-1, FLOT1, MTMR1, GOSR2, AF055029, AL050166, AF070528, SCYC2 by 
15 EL-12 and genes KIAA1013, ID2, KIAA0750, STK17B, HMGCS1, AL049940, AUH, 
BCL2A1 and HIC by IL-4 has not been previously described. 

Regulation of gene expression by TGFp 

TGFp regulated expression of numerous genes after 2 or 6 hours in Thl or Th2 in 

20 polarizing conditions (Table I). However, only few of these genes (ID3, CCL20, RTP801, 
LAMA3, R32184_3) were also regulated by IL-4 or IL-12. After 48 hours the antagonizing 
effects of TGFp on the genes regulated by IL-12 or IL-4 become more evident The 
presence of TGFp in Thl conditions induced the expression of 16 genes and repressed 
expression of 6 genes (see Table I and FIG. 2D). These TGFP target genes included 

25 specific targets of TGFP, but also genes regulated by IL-12 or IL-4. TGFp antagonized the 
effects of IL-12 by upregulating expression of TNFRSF9 and by repressing expression of 
GZMB/CTLA-1, a gene induced by both IL-12 and IL-4. Interestingly, in Thl conditions 
TGFp also downregulated the expression of IL-4-inducible genes NFIL3/E4BP4 and 
SATB1 and induced expression of VIM, which was preferentially expressed by the cells 

30 cultured in Th2 conditions when compared to those cultured in Thl conditions. 

In Th2 conditions TGFp induced expression of 16 genes and repressed 15 genes 
(see Table I and FIG. 2E). Again TGFp regulated a specific set of its own targets, but 
importantly it antagonized the effects of IL-4 by repressing a set of DL-4-inducible genes 
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(ED2, Cox-2, PLA2G4A, BCL2A1, NFIL3) and expression of GNAI1, which was 
preferentially expressed in the cells cultured in Th2 conditions when compared to Thl 
conditions. 

Validation of a set of genes using Real-Time RT-PCR 
5 Oligonucleotide array results were obtained from two independent experiments. 

Four of the genes (SATB1, DUSP6, E4BP4 and TIP3) identified to be differentially 
expressed by the cells cultured in Thl and Th2 conditions were selected for TaqMan RT- 
PCR analysis to further validate the results obtained with oligonucleotide arrays and to 
follow the expression kinetics of this set of interesting genes in Thl and Th2 conditions 

10 during the one week of polarization from Thp cells. The results obtained with Real-Time 
RT-PCR were concordant with the oligonucleotide array data (FIG. 3). Importantly, 
NFIL3/E4BP4, TIP3/SOCS-1 and DUSP6 were identified to be differentially expressed in 
the cells cultured in Thl and Th2 conditions already after six hours of polarization and the 
differences were maintained for at least the two first days of polarization. Also SATB1 was 

15 differentially expressed after six hours in three of four individuals studied, and in all 

individuals after one and two days of polarization. DUSP6 is expressed as two alternative 
splicing variants, and thus expression of both forms was quantitated. According to the 
results, only the long form is expressed by the cells studied. Also the expression of gene 
GADD45J3 was verified with RT-PCR based on the data analysis with the previous version 

20 of the Affymetrix analysis program, MAS4™, which found the gene to be preferentially 
expressed by the cells cultured in Thl conditions compared to Th2 conditions as described 
before (27, 28). However, after reanalysis of the data with the updated MAS5 program, 
GADD45J3 was excluded from the results, and therefore, not presented in the FIG. 2. The 
RT-PCR analysis confirmed the differences in the expression of GADD45fi in the cells 

25 cultured in Thl or Th2 conditions. 

■ 

IL4 inducible genes in murine CD4* T lymphocytes 

To identify EL4 inducible genes in murine CD4 + T lymphocytes, CD4 + T cells 
were isolated from Stat6 +/+ mice (Balb/cJ mice) and were activated with plate-bound anti- 
CD3 and soluble anti-CD28. IL4, anti-EL12 and anti-IFNy were added for Th2 
30 development. By comparing microarray data from Th2 (Deactivation) vs. ThO 

(activation alone) at 48h, IL4 inducible genes were identified. From 5 independent cell 
cultures, 1 17 probe sets (116 known genes) were regulated by EL4, 26 of 1 17 were uj}- 
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regulated and 91 of them were down-regulated (Table 3). The differentially expressed 
genes were divided to 6 groups: immune response and related, transcription regulation and 
DNA binding, enzymes and inhibitors, apoptosis and cell proliferation, cell surface and 
structure proteins and miscellaneous* 

5 Stat6 regulated murine genes 

As T cells from Stat6 * mice have the deficiency for Th2 development, we 
isolated both Stat6 _A and Stat6 +/+ CD4 + T cells, activated and cultured them for Th2 
polarization. At 48h, cells were harvested; KNA was isolated and subjected to DNA 
microarray analysis. Stat6 target genes were identified by comparing DNA microarray 

10 data obtained from Stat6 Th2 (IL4+activation) cells to data obtained from Stat6 +/+ Th2 
(Deactivation) cells. 37 probe sets (36 known genes) were differentially expressed. 17 
(including Stat6) of 37 probe sets were down-regulated and 20 probe sets were 
downregulated in Stat6 deficient cells as compared to the wild type cells (Table 3). Within 
these genes, 20 are induced by EL4, including the known Stat6 target genes, IL4 and 

15 Gata3.. Hipk2 and Nfil3 have been shown to be Stat6 target genes also in B cells (29). 
Stat6 is able to compete with Statl for its binding to the EFNy promoter and inhibit its 
expression (30, 3 1, 32). Therefore, it makes sense that a number of IFN regulated genes, 
such as Ifi205, Ifi203, IsglS and Isg20, were up-regulated in Stat6 v " CD4 + T cells. 
Expression of certain genes known to be expressed mainly in Thl cells or induced by 

20 IFNy, including I118rl, CCR5, Gzmb, Ly6a, and Txk, was inhibited by Stat6. 

20 of 37 known Stat6 target genes are induced by DL4, whereas 20 of 1 17 EL4 
inducible genes are regulated via Stat6 (Table 3). Because there are other signal pathways 
involved in EL4 signalling (33), it is evident that not all the EL4 inducible genes are 
regulated by Stat6. Besides the 20 IL4 inducible and Stat6 regulatory genes, there were 17 

25 Stat6 regulated genes that are not induced by DL4. 

Kinetic study ofIL4 inducible andStat6 regulated genes in mice 

In order to elucidate the molecular mechanisms for T cell differentiation at very 
early stage and also to follow the early changes for those IL4 and State regulated genes 
identified at 48h, the kinetics of gene expression profiles at early T cell polarization were 
30 studied by using oligonucleotide arrays (Affymetrix). At early time points(2h and 6h), a 
group of novel IL4 inducible genes were identified, including Bcl2, Atf3, Pole2, Cish and 
Crabp2. 114ra and Prosl, twoknown IL4 inducible genes, were induced by IL4 already at 
early time points. Figl was induced by IL4 at 2h and remained upregulated at 24h. 
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Interestingly, there were more genes induced by IL4 at early time points; on the contrary, 
there were more genes suppressed by IL4 at 48h. Most Thl marker genes or interferon 
inducible genes, such as I118rl, Ifi203, Ifi204, Ifi205, Ifog, Ifitl, 

became differentially expressed at 24h or 48h.[0002] The kinetic study, revealed a group 
5 of early Stat6 regulated genes, such as Pole2, Ppp3cc, Figl, Prosl, Casp6, 114ra and IL4. 
All of these genes were also IL4 inducible genes. At 24h and 48h, Myo6, Cmkbr8 
(CCR8), Crabp2, ffipk2, Atf3, Nfil3, Zfpl 18 were identified as Stat6 regulated genes 
The novel IL4 inducible or Stat6 target genes in murine cells identified in this 
study are summarized in Table 6. 

1 0 Taqman and FA CS analysis confirms the microarray results 

To verify the findings from microarray data, selected genes were further studied 
by real-time quantitative RT-PCR. Based on microarray results, Hipk2, Nfil3, Zfpl 18 and 
At£3 are induced by IL4 and are regulated by Stat6. Expression of Ifi203 is inhibited by 
IL4 via Stat6. Expression of these genes was measured using real-time PCR during the first 

15 two days of CD4 + T cell differentiation using samples from three independent cell cultures. 
In Stat6 +/+ CD4 + T cells the expression of Hipk2 was induced by IL4 quickly (2h) and the 
difference in Hipk2 expression between Thl and Th2 cells further increased at 24h and 
48h. Hipk2 expression was lower in Stat6-deficient cells compared to wild type cells at all 
the time points. Similar results were obtained for Nfil3 and Zflpl 18. Atf3 was induced by 

20 IL4 already at 2k In Stat6 -/- CD4 + T cells, the expression level of Atf3 in Th2 condition 
was lower than in Stat6 +/+ T cells. In contrast, the expression of D5203 was inhibited by 
IL4. When Stat6 is deficient, especially at 48h, its expression level was increased and the 
biggest difference was seen quite late (48h). We consider that Ifi203 could be a secondary 
target of Stat6. Together, these results show that these genes were induced or repressed by 

25 IL4 through Stat6. 

The Ly-6 family of cell surface molecules has previously been shown to 
participate in T cell activation. The expression of Ly-6A/E is upregulated on normal 
murine T and B cells by IFN-gamma. The protein expression of Ly6a at 48h polarized 
Th2 cells was measured by FACScan using FITC-conjugated rat anti-mouse Ly-6A/E 

30 monoclonal antibody. As expected based on microarray results, at 48h the protein 

expression of Ly6a was upregulated (90%) in Stat6 -/- cells cultured in Th2 conditions 
compared to wild type cells (45%) cultured equally. 
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C. Discussion 

Our study demonstrates that triggering of the T cell receptor leads to regulation of 
more than 1000 genes with various functions. Compared to that number, the group of 
target genes regulated by cytokines mediating the differentiation process is specific and 

5 limited to only less than 100 genes, at least at this stage of differentiation. 

Comparison of the cells cultured in Thl or Th2 conditions to each other or to 
CD3/CD28-activated cells revealed changes in the expression of 124 genes. Of these genes 
approximately 77 have not been previously associated with Thl and Th2 polarization. 
Differential regulation of genes DUSP6, E4BP4/NFIL3, SATB1 and T1P3/SOCS-1 was 

1 0 further confirmed using Real-Time RT-PCR. Preferential induction of these genes by IL-4 
already after 2 hours of Th2 polarization positions these genes as important candidates as 
upstream regulators of the early differentiation process. The roles of these genes in Thl 
and Th2 differentiation are currently unknown. DUSP6 is a phosphatase, which inhibits 
activity of ERK2 (35). E4BP4/NFIL3 is an inducer or repressor of transcription, which 

15 can activate IL-3 expressions, and a binding site for this factor is also present in the 

promoter area of IFNy. In pro-B lymphocytes E4BP4/NFIL3 has been found to participate 
in preventing apoptosis in response to IL-3 through ras-mediated signaling, which involves 
activation of both PI3K and rafMAPK pathways (36, 37). SATB1 is a DNA binding 
protein, which is known to be involved in the development of thymic T cells (38). Its 

20 cleavage by Caspase-6 and resulting dissociation from chromatin is involved in nuclear 
degradation occurring during early apoptosis of T cells (39). TIP3/SOCS-1 induced by 
cytokine/STAT pathway is an inhibitor of cytokine signaling. Previous reports have 
provided controversial information concerning the role of TIP3/SOCS-1 in Thl and Th2 
cells; on one hand the protein has been shown to be preferentially expressed by the Thl 

25 cells in mouse (40). On the other hand, it has been reported that TEP3/SOCS- 1 is induced 
' by IL-6 in human, which promotes the Th2 and inhibits the Thl commitment (40, 41). 
DUSP6, SATB1, E4BP4/NFIL3 and TIP3/SOCS-1 are likely to be among those upstream 
factors that respond first to the polarizing signals and thus are involved in determining the 
fate of Thp cells during the early stages of polarization. 

30 It has been previously shown that TGF(3 inhibits the differentiation of Thl and 

Th2 subtypes (42, 43). Studies with mice have shown that the inhibition of Thl and Th2 
differentiation by TGF|3 occurs through suppressing T-bet and GATA-3 expression, 
respectively (44-46). In our study with human cells these two genes were not among the 
numerous primary genes regulated by TGFP indicating that the mechanism of TGFp 
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regulation on Thl and Th2 responses is probably more complex than has been previously 
thought. The effects of TGFp can also vary depending on the stage of cell differentiation, 
cytokine environment and concentration of TGFP used in the cultures (47). 

Enhanced Th2 response is known to contribute to the phenotype and symptoms of 
5 asthma (1-2). On the other hand, recent studies with mouse models have demonstrated that 
TGFpi can suppress the airway hyper-responsiveness and airway inflammation associated 
with asthma (48, 49). However, the molecular mechanism of this suppression is not 
known. One hypothesis could be that TGFP is able to inhibit pathogenesis of asthma 
through its ability to suppress Th2 responses. If this hypothesis is correct, the genes which 

10 are regulated by TGFP to suppress the Th2 development are particularly interesting. Our 
results demonstrate that TGFp regulates a set of the same target genes as IL-12 and EL-4 
(ID3, CCL20, RTP801, LAMA3, R32184J, INFSF9, E4BP4, CTLA1/GZMB, ID2, Cox-2, 
GNAI1, PLA2G4A, BCL2A1). Interestingly, most of these genes coregulated by EL-12/IL- 
4 and TGFp are known to participate in the signaling events involved in apoptosis or 

1 5 survival. The IL-12 or EL-4 antagonizing influence of TGFp on the expression of these 

genes could partly explain the inhibitory effect of TGFp on differentiation. If regulation of 
the expression of these genes can regulate the differentiation process, they might be critical 
factors for the polarization. 

In addition to genes regulated by IL-12 and IL-4, TGFp also regulated many 

20 other interesting targets such as DUSP4, ] IL-9, 1L1RN and LGAL3. DUSP4, which is known 
to inhibit ERK1/2, JNK1/2 and c-jun activity, was upregulated by TGFp in Thl conditions 
(50-51). JNK1 and JNK2 have been associated with Thl and Th2 differentiation (52-53). 
Studies with knockout mice have demonstrated that in the absence of JNK1, the 
differentiation of Thl and Th2 subsets is impaired and the cells preferentially polarize to 

25 Th2 subtypes. JNK2 seemed to be required for proper Thl differentiation. Inhibition of 
JNK2 activity through DUSP4, could provide one possible mechanism responsible for • 
inhibiting Thl differentiation by TGFp. Interestingly, in Th2 conditions TGFp induced 
expression of certain genes that have been previously connected to asthma and other 
inflammatory diseases. These genes included such as 1L1RN, IL-9 and LGAL3. IL1KN is 

30 known to inhibit the IL-1 signaling by binding to IL1R, which according to our results is 
upregulated by IL-4. Polymorphisms in the IL1RN gene have been associated with various 
inflammatory diseases, such as asthma and diabetes (54, 55). IL-9 is a Th2-type cytokine 
and a candidate gene for asthma (56). It is involved in induction of many symptoms / 
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associated with asthma and has been considered as a potential target for asthma therapy 
(57). LGAL3 is involved in downregulation of Th2 cytokine IL-5 and it has been used to 
repress the symptoms of asthma in rat models (58, 59). Taken together, in addition to 
antagonizing the effects of EL- 12 and IL-4, in Th2 conditions TGF|3 regulates expression of 
5 various genes involved in inflammatory diseases. These genes are likely to play an 
important role in maintaining the balance between Thl and Th2 responses. 

J, <f> ib 

It is understood that the examples and embodiments described herein are for 
illustrative purposes only and that various modifications or changes in light thereof will be 
10 suggested to persons skilled in the art and are to be included within the spirit and purview 
of this application and scope of the appended claims. All publications, patents, and patent 
applications cited herein are hereby incorporated by reference in their entirety for all 
purposes to the same extent as if each individual publication, patent or patent application 
were specifically and individually indicated to be so incorporated by reference. 
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AB020630 


X76220 


AB018356 


M 15856 

i 


AF070533 


X59871 

i 


Z49194 


AL021154 


AL049233 


AF097935 


AB013382 ! 


AF024714 


X83467 


AL096713 


M97287 


AB023230 


D13891 


AB018293 


AB011421 


SYMBOL 


CNTN5 


S100P 


ABCD3 


UBL3 


H2BFQ 


PPP1R16B 


— i 
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SIAT9 


LPL 


OPTN 


TCF7 


POU2AF1 


!D3 




DSG1 


DUSP6 


AIM2 


ABCD3 j 


FER1L3 


SATB1 


KIAA1013 


CM 

9 


KIAA0750 


STK17B 


GENE NAME 


contactin 5 


S100 calcium binding protein P 


ATP-binding cassette, sub-family D (ALD), member 


ubiquitin-like 3 


H2B histone family, member Q 


i protein phosphatase 1 , regulatory (inhibitor) subunit 


mal, T-cell differentiation protein 


sialyltransferase 9 (CMP-NeuAc:lactosylceramide 
! alpha-2,3-sialyltransferase; GM3 synthase) 


lipoprotein lipase 


Optineurin 


transcription factor 7 (T-cell specific, HMG-box) 


POU domain, class 2, associating factor 1 


inhibitor of DNA binding 3, dominant negative helix- 
loop-helix protein 


Homo sapiens mRNA; cDNA DKFZp564A023 (from 
clone DKFZp564A023) 


Desmoglein 1 


dual specificity phosphatase 6 


absent in melanoma 2 


peroxisomal membrane protein-1 


fer-1-like 3, myoferlin 


special AT-rich sequence binding protein 1 (binds to 
nuclear matrix/scaffold-associating DNA's) 


KIAA1013 protein 


inhibitor of DNA binding 2, dominant negative helix- 
loop-helix protein 


KIAA0750 gene product 


serine/threonine kinase 17b (apoptosis-inducing) 
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Table IV. The human oligos used for quantitative Real-Time RT-PCR. 



ACCESSION GENE 

AB013602, AB013382 DUSP6 (short & long) 

AB013382 DUSP6 (long) 

X64318 E4BP4(NFIL3) 

J04617 EF1ct 

AF078077 GADD450 (Myd1 18) 

X55122 GATA-3 

M37265 IFNy 

m97287 SATB1 

AF241243 T-bet 

AB000734 TIP3 (SSI-1, SOCS-1, JAB) 



1) 5'- ejFAMJ-PROBE-fTAMRA)-^ 

2) 5' -PRIMER 1-3' 

3) 5' -PRIMER 2 -3' 

1) 5 f -CTCTACGACGAGAGCAGCAGCGACTG-3" 

2) S'-GCTGTGGCACCGACACAGT-S' 

3) S'^CTCGCCGCCCGTATTCT-S' 

1) 5*-CTCTACGACGAGAGCAGCAGCGACTG-3' 

2) 5-GCTGTGGCACCGACACAGT-3' 

3) 5*-GAACTCGGCTTGGAACTTACTGAA-3' 

1 ) S^TCCTCAGTAGAACAC ACGCAGGAGAGCTC-3' 

2) 5'-AGCTCGCTGTCCGATGTTTC-3' 

3) 5'-CTTCTGCAGCTTCCCTGCAC-3 

1) 5' -AGCGCCGGCTATGCCCCTG- 3' 

2) 5* -CTGAACCATCCAGGCCAAAT- 3* 

3) 5' -GCCGTGTGGCAATCCAAT- 3 r 

1) 3 , -TGGCCACCTCCACCAAGCCG-3' 

2) S-CCGGCTTTCTTCGCAGTAG-S' 

3) 3'-CACGGACGCCTGGAAGA-5' 

1) S'-TGCCGGAGGAGGTGGATGTGCT-S' 

2) 5-GGACGCGGCGCAGTAC-3' 

3) 5'-TGCCTTGACCGTCGATGTTA-3' 

1) S'-TGCTGGCGACAGTTCAGCCATCAC-r 

2) 5 r -CTCGAAACAGCATCTGACTCCTT-3' 

3) 5'-TGTCCAACGCAAAGCAATACA-3' 

1) 5'-AACGAGCAGGAATCTCCCAGGCG-3* 

2) 5-ACCAGTG G GTACG C G ATGA-3' 

3) 5-TGTTAAAAGCCACACGTGCAA- 3' 

1) 5* -TCAGCATGAAGCCTGCATTCTTGCC- 3 1 

2) 5' -ACAGCTATGAGGCTGAGTTTCGA- 3 1 

3) 5' -GGCCTCGGTAGTAGGACATGG T- 3' 

1) 5'-TTCGCACGCCGATTACCGGC-'3 

2) 5'-ACACGCACTTCCGCACATT-3 , 

3) S'-CTGGCGCGCGTGATG-'S 
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CLAIMS: 

1. A method of identifying a compound capable of modulating the polarization of 
CD4+ lymphocytes, the method comprising: 

(a) contacting said compound with naive CD4+ lymphocytes; 

(b) inducing the polarization of the lymphocytes; 

(c) preparing a gene expression profile from the lymphocytes; 

(d) comparing the lymphocyte gene expression profile to a gene expression profile 
derived from Table 1 . 

2. The method according to claim 1, wherein the induction of step (b) is 
performed by contacting the lymphocytes with a cytokine. 

3. The method according to claim 2, wherein said cytokine is IL-12 or IL-4. 

4. The method according to any one of the preceding claims, wherein a 
difference in the expression profiles of the target genes identifies a potential drug compound 
for the treatment of asthma or other immune-mediated diseases. 

5. A method of identifying a compound that modulates the expression of at least 
one gene listed in Table 2 or Table 6, the method comprising: 

(a) incubating a cell that can express a protein from said gene with a compound 
under conditions and for a time sufficient for the cell to express the protein- of 
said gene, when the compound is not present; 

(b) . incubating a control cell under the same conditions and for the same time 

without the compound; 

(c) measuring expression of said gene in the cell in the presence of the compound; 

(d) measuring expression of said gene in the control cell; and 

(e) comparing the amount of expression of said gene in the presence and absence 
of the compound, wherein a difference in the level of expression indicates that 
the compound modulates the expression of said gene. 

6. A method of identifying a compound that modulates the activity of at least one 
gene listed in Table 2 or Table 6, the method comprising: 
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(a) incubating a cell that has said activity with a compound under conditions and 
for a time sufficient for the cell to express said activity, when the compound is 
not present; 

(b) incubating a control cell under the same conditions and for the same time 
without the compound; 

(c) measuring said activity in the cell in the presence of the compound; 

(d) measuring said activity in the control cell; and 

(e) comparing the amount of said activity in the presence and absence of the 
compound, wherein a difference in the level of expression indicates that the 
compound modulates the activity of said gene. 

7. A method of treating a patient with asthma or other immune-mediated disease, 
the method comprising administering to the patient a pharmaceutical composition, wherein 
the composition alters the expression or activity of at least one gene listed in Table 2 or Table 
6. 

8. The method according to claim 7, wherein the active compound of said 
pharmaceutical composition is identified by a method of any one of claims 1-6. 

9. The method according to claim 7, wherein the active compound of said 
pharmaceutical composition is an antibody binding to at least one gene product of the genes 
listed in Table 2 or Table 6. 

1 0. A pharmaceutical composition altering the expression or activity of at least 
one gene listed in Table 2 or Table 6 for use in prophylaxis or treatment of asthma or other 
immune-mediated disease. 

1 1 . The pharmaceutical composition according to claim 9, comprising an antibody 
binding to at least one gene product of the genes listed in Table 2 or Table 6 as an active 
ingredient. 

12. A method of identifying a compound capable of modulating the polarization 
of murine CD4+ lymphocytes, the method comprising: 

(a) contacting said compound with naive murine CD4+ lymphocytes; 

(b) inducing the polarization of the lymphocytes; 

(c) preparing a gene expression profile from the lymphocytes; 
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comparing the lymphocyte gene expression profile to a gene expression profile 
derived from Table 3. 

13. A method of determining the activity of the compound identified in a method 
according to any one of claims 1-6 in a mouse model, the method comprising 
5 (a) contacting said compound with naive murine CD4+ lymphocytes; 

(b) inducing the polarization of the lymphocytes; 

(c) preparing a gene expression profile from the lymphocytes; 
comparing the lymphocyte gene expression profile to a gene expression profile 

derived from Table 3. 

1° 14. A method according to any one of claims 1-4, wherein said gene expression 

profile derived from Table 1 is at least partly based on the expression fold changes of any one 
of the genes selected from the group consisting of: KIAA0053, LRRN3, CIG5, DUSP6, 
FER1L3, S100P, SATB1, SLC1 1A2, STK17B, a gene identified by accession number 
AJ97 1 1 69 and a gene identified by accession number AL43240 1. 

15 15. A method according to any one of claims 5-11, wherein the gene or genes 

is/are selected from the group consisting of: KIAA0053, LRRN3, CIG5, DUSP6, FER1L3, 
S100P, SATB1, SLC1 1 A2, STK17B, a gene identified by accession number AI971 169 and a 
gene identified by accession number AL432401 . 

16. A method according to any of claims 1-4, wherein said lymphocyte gene 
20 expression profile and said gene expression profile derived from Table 1 each comprise at 

least 5 of the same genes listed in Table 1 . 

17. The method of claim 16, wherein the lymphocyte gene expression profile and 
the gene expression profile derived from Table 1 each comprise at least 10 of the same genes 
listed in Table 1. 

25 18. The method of claim 1 7, wherein the lymphocyte gene expression profile and 

the gene expression profile derived from Table 1 each comprise at least 25 of the same genes 
listed in Table 1 . 

19. The method according to claim 5, wherein measuring steps (c) and (d) each 
comprise measuring the expression of at least 5 genes from those listed in Table 2 or Table 6 
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and comparing comprises comparing the expression of these genes in both the cell and 
control cell. 

20. The method according to claim 19, wherein the expression of at least 10 genes 
is measured in steps (c) and (d). 

5 21. The method according to claim 20, wherein the expression of at least 25 genes 

is measured in steps (c) and (d). 

22. A method according to any of claims 6-1 1, wherein measuring steps (c) and (d) 
each comprise measuring the activity of at least 5 genes from those listed in Table 2 or Table 
6 and comparing comprises comparing the activity of these genes in both the cell and control 

10 cell. 

23. The method according to claim 22, wherein the activity of at least 10 genes is 
measured in steps (c) and (d). 

24. The method according to claim 23, wherein the activity of at least 25 genes is 
measured in steps (c) and (d). 

15 25. A method for classifying a lymphocyte, comprising: 

(a) providing a test sample derived from the lymphocyte, wherein the 
lymphocyte is capable of expressing one or more nucleic acid markers from the group 
consisting of those listed in Table 2 or Table 6; 

(b) determining the expression level of the one or more markers in the test 

20 sample; 

(c) comparing the expression level of the one or more markers in the test 
sample with the expression level of the same markers in a control sample, wherein the control 
sample is derived from a lymphocytic cell whose cellular status is known; and 

(d) classifying the lymphocyte on the basis, of the comparison of step (c). 

25 26. The method of claim 25, wherein the control sample is derived from a Thl or 

Th2 cell and classifying comprises identifying the lymphocyte as a Thl or Th2 cell 

27. A method for diagnosing the presence of, or a predisposition to, an immune- 
related disorder in a subject, the method comprising: 
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(a) determining the expression level of one or more nucleic acid markers in a 
test sample obtained from the subject, wherein the one or more nucleic acid markers are 
selected from the group consisting of those listed in Table 2 or Table 6; 

(b) comparing the expression level of the one or more nucleic acid markers in 
5 the test sample with the expression level of the same markers in a control sample whose 

immune status is known; and 

(c) diagnosing the presence or absence of the immune disorder in 
the subject, or a predisposition to the immune disorder, on the basis of the comparison of step 
(b). 

10 28. A method for identifying a compound that modulates differentiation of a 

lymphocyte, the method comprising: . 

(a) contacting a test cell capable of expressing one or more gene markers listed 
in Table 2 or Table 6 with a test compound; and 

(b) determining the expression level of the one or more gene markers in the test 

15 cell; and 

(c) comparing the expression level of the one or more gene markers with the 
expression level of the same makers for a control cell, wherein 

the test cell and the control cell are lymphocytes and the cellular 
state of the control cell is known; and 

20 a difference in the expression level between the test and control 

cell is an indicator that the test compound is a modulator of lymphocyte differentiation. 

29. The method according to claim 28, wherein the expression level of at least 5 
gene markers are determined and compared. 

30. The method according to claim 29, wherein the expression level of at least 10 
25 gene markers are determined and compared. 

3 1 . The method according to claim 30, wherein the expression level of at least 25 
gene markers are determined and compared. 
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Box No. 1 1 Observations where certain claims were found unsearchable (Continuation of item 2 of first sheet) 



This international search report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 



1- [X| Claims Nos.: 7-9 

because they relate to subject matter not required to be searched by this Authority, namely: 

Claims 7-9 relate to methods of treatment of the human or 
animal body by surgery or by therapy or diagnostic methods 
practiced on the human or animal body (PCT Rule 39.1 (iv) ) . 



2. Claims Nos.: 10-11 and 7-9 

because they relate to parts of the international application that do not comply with the prescribed requirements to such an 
extent that no meaningful international search can be carried out, specifically: 

see next sheet 



3 - £j Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
Box No. HI Observations where unity of invention is lacking (Continuation of item 3 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 

see next sheet 



1 . Q As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment of 
any additional fee. 

3. Q As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. ^ No required additional search fees were timely paid by the applicant. Consequently, this international search report is 

restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 1-31 all partially 



Remark on Protest j^J The additional search tees were accompanied by the applicant's protest. 

| [ No protest accompanied the payment of additional search fees. 
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Box No. IV Text of the abstract (Continuation of item 5 of the first sheet) 



<Box IL2> 

Present claims 10-1 1 and 7-9 relate to a product defined by reference to a desirable 
characteristic or property, namely that the composition "alters the expression or activity of at 
least one gene listed in Table 2 or Table 6". The claims cover all compounds having this 
characteristic or property, whereas the application lacks support within the meaning of Article 6 
PCT and disclosure within the meaning of Article 5 PCT for such products. No examples of 
compounds are given in the description. In claim 1 1 it is stated that the compound is an 
antibody, but no specific examples of such antibodies are provided in the description. 
Additionally, previously known compounds may be included in the scope of the present claims. 
In the present case, the claims so lack support, and the application so lacks disclosure, that a 
meaningful search over the whole of the claimed scope is impossible. Independent of the above 
reasoning, the claims also lack clarity (Article 6 PCT). An attempt is made to define the product 
by reference to a result to be achieved. Again, this lack of clarity in the present case is such as to 
render a meaningful search for claims 7-1 1 impossible. 



<Box III> 

i 

1: Claims: 1-31 (all partially) directed to a method of identifying a compound capable of 
modulating the polarization of CD4+ lymphocytes comprising comparing the expression of 
KIAA0053 in lymphocytes contacted with the compound and induced to polarize, methods of 
identifying a compound that modulates the expression or activity of KIAA0053, methods for 
classifying lymphocytes or diagnosing immune related disorders by measuring expression of 
KIAA0053 and further methods comprising the step of determining the expression level of 
KIAA0053 

2: Claims: 1-31 (all partially) directed to a method of identifying a compound capable of 
modulating the polarization of CD4+ lymphocytes comprising comparing the expression of 
LRRN3 in lymphocytes contacted with the compound and induced to polarize, methods of 
identifying a compound that modulates the expression or activity of LRRN3, methods for 
classifying lymphocytes or diagnosing immune related disorders by measuring expression of 
LRRN3 and further methods 

comprising the step of determining the expression level of LRRN3 

3-1 1 : Claims: 1-31 (all partially) directed to a method of identifying a compound capable of 
modulating the polarization of CD4+ lymphocytes comprising comparing fee expression of each 
of the other nine genes mentioned in claims 14-15 in lymphocytes contacted with the compound 
and induced to polarize, methods of identifying a compound that modulates the expression or 
activity of each of the other nine genes mentioned in claims 14-15, methods for classifying 
lymphocytes or diagnosing immune related disorders by measuring expression of each of the 
other nine genes mentioned in claims 14-15 and further methods comprising the step of 
determining the expression level of each of the other nine genes mentioned in claims 14-15 

.../... 
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The searching authority could not find any same or corresponding technical feature in terms of a 
common property or activity, and a shared structural element essential to such a common 
property or activity, among the genes listed in the application. No grouping of the genes has 
therefore been made, and each gene is so far considered to constitute one invention (see also the 
motivation below). 

The present application has been considered to contain 1 1 inventions which are not linked such 
that they form a single general inventive concept, as required by Rules 13.1, 13.2 and 13.3 PCT 
for the following reasons: 

The prior art has been identified as: WO01 88199 (Dl). 

The marker genes disclosed in Dl are identified by analyzing the changes in gene expression in 
response to Thl-inducing or Th2-inducing conditions. Naive CD4+ cells are activated by contact 
with CD3/CD28 and either IL-12 (Thl-inducing conditions) or IL-4 (Th2-inducing conditions). 
RNA was isolated from samples taken at different times and RNA probes were used to probe 
the Affymetrix human 6800 DNA microarray set. Genes with significant differences between 
expression in Thl and/or Th2 cells as compared to naive undifferentiated CD4+ T cells are 
shown. Naive CD4+ T cells are cultured in the presence of anti-CD3 and anti-CD28. 
Dl discloses a method for assessing whether Thl or Th2 cells are present in a subject 
comprising comparing the expression level of a marker gene in a sample from a subject and in a 
control sample, methods for monitoring the differentiation of naive T cells into Thl or Th2 
cells, methods for assessing the efficacy of a test compound or therapy for modulating 
differentiation of Thl or Th2 cells in a subject, methods of selecting a composition for 
modulating differentiation, growth or maturation of Thl or Th2 cells and methods for assessing 
the potential of a test compound to trigger the differentiation of Thl or Th2 cells from naive T 
cells, all methods involving measuring the expression of genes differentially expressed in Thl 
and/or Th2 cells in a sample and a control. The marker genes can be used to diagnosing 
conditions associated to Th-cells (e.g. asthma) by measuring the activity or expression of the 
marker gene in a sample. Dl also provides methods for identifying compounds which have a 
modulatory effect on the activity of the marker gene. See abstract, p. 4 line 10- p. 8 line 20, p. 
10 line 20-36, p. 1 1 lines 25-37, p. 55 lines 15-24 p. 78-80, examples E-F, and the claims. 

Thus, Dl shows genes being differentially expressed in CD4+ cells polarized under conditions 
corresponding to conditions used in the present application for identifying genes included in Tables 
1-3 and 6. 

The genes mentioned in claims 14-15 are differentially expressed in Thl-induced vs. Th2-induced 
cells (see Table 1), genes with such characteristics are shown in Dl. Genes with an increased 
expression in Th2 cells as compared to Thl cells are shown in Dl . 

.../... 
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Genes differentially expressed within and/or Th-cell subpopulations are also disclosed in 
US64141 17 (D2). Murine as well as human genes are identified. The genes identified can be 
used diagnostically or as targets for therapeutic intervention relating to immune disorders. The 
genes are identified essentially as in Dl. D2 provides methods for the identification of 
compounds which modulate the expression of genes or the activity of gene products involved in 
TH cell subpopulati on-related disorders. See the abstract, col. 1 lines 16-49, col. 3 line 63-col. 4 
line 1, col. 4 lines 12-34 and col. 76 lines 49-62. 

Hamalainen H et al Genome Biol. 2001 ;2(7): (D3) also shows genes differentially expressed by 
polarized Thl and Th2 cells. The genes are identified by polarization conditions as the ones 
used in the present application. See abstract, p. 5 right col. last paragraph and p. 9 left col. 
paragraph 2. 

Invention I: 

The special technical features which make a contribution over this prior art (Rule 13.2 PCT) are 
the following: the gene KIAA0053 is identified as being differentially expressed in CD4+ cells 
polarized under certain conditions. 

From these special technical features the objective problem to be solved by the first invention 
can be construed as: providing the gene marker KIAA0053, which is differentially expressed in 
CD4+ cells polarized under certain conditions. 

Invention II-XI: 

The special technical features of claims 1-31 (all partially) are the following: each of the other 
nine marker genes mentioned in claims 14-1 5 is identified as being differentially expressed in 
CD4+ cells polarized under certain conditions. 

From these special technical features the objective problem to be solved by the first invention 
can be construed as: providing each of the other nine marker genes mentioned in claims 14-15, 
which is differentially expressed in CD4+ cells polarized under certain conditions. 

The above analysis shows that the special technical features of invention I (claim(s) 1-31 all 
partially) are neither the same as nor corresponding to those of inventions II-XI. 

Consequently, neither the objective problem underlying the subjects of the claimed inventions, 
nor their solutions defined by the special technical features allow for a relationship to be 
established between the said inventions, which involves a single general inventive concept. 

In conclusion, therefore, the XI groups of claims are not linked by common or corresponding 
special technical features and define different inventions not linked by a single general inventive 
concept. 

The application, hence does not meet the requirements of unity of invention as defined in Rule 
13.1 and 13.2 PCT. 
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